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The following material on traveling-wave tubes is taken from a book which 
will be published by Van Nostrand in September, 1950. Substantially the 
entire contents of the book will be published in this and the three succeeding 
issues of the Bell System Technical Journal. 

This material will cover in detail the theory of traveling-wave amplifiers. In 
addition, brief discussions of magnetron amplifiers and double-stream amplifiers 
are included. Experimental material is drawn on in a general way only, as in- 
dicating the range of validity of the theoretical treatments. 

The material deals only with the high-frequency electronic and circuit be- 
havior of tubes. Such matters as matching into circuits are not considered; 
neither are problems of beam formation and electron focusing, which have been 
dealt with elsewhere.' 

The material opens with the presentation of a simplified theory of the travel- 
ing-wave tube. A discussion of circuits follows, including helix calculations, a 
treatment of filler-type circuits, some general circuit considerations which show 
thal gain will be highest for low group velocities and low stored energies, and a 
justification of a simple transmission line treatment of circuits by mean. of an 
expansion in lerms of the normal modes of propagation of a circuit. Then a de- 
tailed analysis of overall electronic and circuit behavior is made, including a 
discussion of various electronic and circuit waves, the fitting of boundary con- 
ditions to obtain overall gain, noise figure calculations, transverse motions of 
electrons and field solutions appropriate to broad electron streams. Short treat- 
ments of the magnetron amplifier and the double-stream amplifier follow. 


1 For instance, “Theory and Design of Electron Beams,” J. R. Pierce, Van Nostrand, 


1949. 
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CHAPTER I 
INTRODUCTION 


STRONOMERS are interested in stars and galaxies, physicists in 

atoms and crystals, and biologists in cells and tissues because these 

are natural objects which are always with us and which we must under- 

stand. The traveling-wave tube is a constructed complication, and it can 

be of interest only when and as long as it successfully competes with older 

and newer microwave devices. In this relative sense, it is successful and 
hence important. 

This does not mean that the traveling-wave tube is better than other 
microwave tubes in all respects. As yet it is somewhat inefficient compared 
with most magnetrons and even with some klystrons, although efficiencies 
of over 10 per cent have been attained. It seems reasonable that the effi- 
ciency of traveling-wave tubes will improve with time, and a related device, 
the magnetron amplifier, promises high efficiencies. Still, efficiency is not the 
chief merit of the traveling-wave tube. 

Nor is gain, although the traveling-wave tubes have been built with gains 
of over 30 db, gains which are rivaled only by the newer double-stream 
amplifier and perhaps by multi-resonator klystrons. 

In noise figure the traveling-wave tube appears to be superior to other 
microwave devices, and noise figures of around 12 db have been reported. 
This is certainly a very important point in its favor. 

Structurally, the traveling-wave tube is simple, and this too is impor- 
tant. Simplicity of structure has made it possible to build successful ampli- 
fiers for frequencies as high as 48,000 megacycles (6.25 mm). When we con- 
sider that successful traveling-wave tubes have been built for 200 mc, we 
realize that the traveling-wave amplifier covers an enormous range of fre- 
quencies. 

The really vital feature of the traveling-wave tube, however, the new 
feature which makes it different from and superior to earlier devices, is its 
tremendous bandwidth. 

It is comparatively easy to build tubes with a 20 per cent bandwidth at 
4,000 me, that is, with a bandwidth of 800 mc, and L. M. Field has reported 
a bandwidth of 3 to 1 extending from 350 mc to 1,050 mc. There seems no 
reason why even broader bandwidths should not be attained. 

As it happens, there is a current need for more bandwidth in the general 
field of communication. For one thing, the rate of transmission of intelli- 
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gence by telegraph, by telephone or by facsimile is directly proportional 
to bandwidth; and, with an increase in communication in all of these fields, 
more bandwidth is needed. 

Further, new services require much more bandwidth than old services. 
A bandwidth of 4,000 cycles suffices for a telephone conversation. A band- 
width of 15,000 cycles is required for a very-high-fidelity program circuit. 
A single black-and-white television channel occupies a bandwidth of about 
4 mc, or approximately a thousand times the bandwidth required for te- 
lephony. 

Beyond these requirements for greater bandwidth to transmit greater 
amounts of intelligence and to provide new types of service, there is cur- 
rently a third need for more bandwidth. In FM broadcasting, a radio fre- 
quency’ bandwidth of 150 ke is used in transmitting a 15 kc audio channel. 
This ten-fold increase in bandwidth does not represent a waste of frequency 
space, because by using the extra bandwidth a considerable immunity to 
noise and interference is achieved. Other attractive types of modulation, 
such as PCM (pulse code modulation) also make use of wide bandwidths 
in overcoming distortion, noise and interference. 

At present, the media of communication which have been used in the past 
are becoming increasingly crowded. With a bandwidth of about 3 me, 
approximately 600 telephone channels can be transmitted on a single coaxial 
cable. It is very hard to make amplifiers which have the high quality neces- 
sary for single sideband transmission with bandwidths more than a few times 
broader than this. In television there are a number of channels suitable for 
local broadcasting in the range around 100 mc, and amplifiers sufficiently 
broad and of sufficiently good quality to amplify a single television channel 
for a small number of times are available. It is clear, however, that at these 
lower frequencies it would be very difficult to provide a number of long-haul 
television channels and to increase telephone and other services substan- 
tially. 

Fortunately, the microwave spectrum, whch has been exploited increas- 
ingly since the war, provides a great deal of new frequency space. For in- 
stance, the entire broadcast band, which is about 1 me wide, is not sufficient 
for one television signal. The small part of the microwave spectrum in the 
wavelength range from 6 to 7} cm has a frequency range of 1,000 mc, which 
is sufficient to transmit many simultaneous television channels, even when 
broad-band methods such as FM or PCM are used. 

In order fully to exploit the microwave spectrum, it is desirable to have 
amplifiers with bandwidths commensurate with the frequency space avail- 
able. This is partly because one wishes to send a great deal of information 
in the microwave range: a great many telephone channels and a substan- 
tial number of television channels. There is another reason why very broad 
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bands are needed in the microwave range. In providing an integrated nation- 
wide communication service, it is necessary for the signals to be amplified 
by many repeaters. Amplification of the single-sideband type of signal used 
in coaxial systems, or even amplification of amplitude modulated signals, 
requires a freedom from distortion in amplifiers which it seems almost 
impossible to attain at microwave frequencies, and a freedom from inter- 
fering signals which it will be very difficult to attain. For these reasons, it 
seems almost essential to rely on methods of modulation which use a large 
bandwidth in order to overcome both amplifier distortion and also inter- 
ference. 

Many microwave amplifiers are inferior in bandwidth to amplifiers avail- 
able at lower frequencies. Klystrons give perhaps a little less bandwidth than 
good low-frequency pentodes. The type 416A triode, recently developed at 
Bell Telephone Laboratories, gives bandwidths in the 4,000 mc range some- 
what larger than those attainable at lower frequencies. Both the klystron 
and the triode have, however, the same fundamental limitation as do other 
conventional tubes. As the band is broadened at any frequency, the gain is 
necessarily decreased, and for a given tube there is a bandwidth beyond 
which no gain is available. This is so because the signal must be applied by 
means of some sort of resonant circuit across a capacitance at the input of 
the tube. ; 

In the traveling-wave tube, this limitation is overcome completely. There 
is no input capacitance nor any resonant circuit. The tube is a smooth trans- 
mission line with a negative attenuation in the forward direction and a 
positive attenuation in the backward direction. The bandwidth can be 
limited by transducers connecting the circuit of the tube to the source and 
the load, but the bandwidth of such transducers can be made very great. 
The tube itself has a gradual change of gain with frequency, and we have seen 
that this allows a bandwidth of three times and perhaps more. This means 
that bandwidths of more than 1,000 mc are available in the microwave 
range. Such bandwidths are indeed so great that at present we have no means 
for fully exploiting them. 

In all, the traveling-wave tube compares favorably with other microwave 
devices in gain, in noise figure, in simplicity of construction and in fre- 
quency range. While it is not as good as the magnetron in efficiency, reason- 
able efficiencies can be attained and greater efficiencies are to be expected. 
Finally, it does provide amplification over a bandwidth commensurate with 
the frequency space available at microwaves. 

The purpose of this book is to collect and present theoretical material 
which will be useful to those who want to know about, to design or to do 
research on traveling-wave tubes. Some of this material has appeared in 
print. Other parts of the material are new. The old material and the new 
material have been given a common notation. 
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The material covers the radio-frequency aspects of the electronic behavior 
of the tube and its internal circuit behavior. Matters such as matching into 
and out of the slow-wave structures which are described are not considered. 
Neither are problems of producing and focusing electron beams, which 
have been discussed elsewhere,' nor are those of mechanical structure nor of 
heat dissipation. 

In the field covered, an effort has been made to select material of practical 
value, and to present it as understandably as possible. References to vari- 
ous publications cover some of the finer points. The book refers to experi- 
mental data only incidentally in making general evaluations of theoretical 
results. 

To try to present the theory of the traveling-wave tube is difficult with- 
out some reference to the overall picture which the theory is supposed to 
give. One feels in the position of lifting himself by his bootstraps. For this 
reason the following chapter gives a brief general description of the travel- 
ing-wave tube and a brief and specialized analysis of its operation. This 
chapter is intended to give the reader some insight into the nature of the 
problems which are to be met. In Chapters III through VI, slow-wave cir- 
cuits are discussed to give a qualitative and quantitative idea of their na- 
ture and limitations. Then, simplified equations for the overall behavior of 
the tube are introduced and solved, and matters such as overall gain, inser- 


tion of loss, a-c space-charge effects, noise figure, field analysis of operation 
and transverse field operation are considered. A brief discussion of power 
output is given. 


Two final chapters discuss briefly two closely related types of tube; the 
traveling-wave magnetron amplifier and the double-stream amplifier. 


! loc. cit. 
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CHAPTER II 


SIMPLE THEORY OF 
TRAVELING-WAVE TUBE GAIN 


SYNOPSIS OF CHAPTER 


T IS difficult to describe general circuit or electronic features of traveling- 
wave tubes without some picture of a traveling-wave tube and traveling- 
wave gain. In this chapter a typical tube is described, and a simple theoret- 
ical treatment is carried far enough to describe traveling-wave gain in terms 
of an increasing electromagnetic and space-charge wave and to express the 
rate of increase in terms of electronic and circuit parameters. 

In particular, Fig. 2.1 shows a typical traveling-wave tube. The parts of 
this (or of any other traveling-wave tube) which are discussed are the elec- 
tron beam and the slow-wave circuit, represented in Fig. 2.2 by an electron 
beam and a helix. 

In order to derive equations covering this portion of the tube, the proper- 
ties of the helix are simulated by the simple delay line or network of Fig. 2.3, 
and ordinary network equations are applied. The electrons are assumed to 
flow very close to the line, so that all displacement current due to the pres- 
ence of electrons flows directly into the line as an impressed current 

For small signals a wave-type solution of the equations is known to exist, 
in which all a-c electronic and circuit quantities vary with time and dis- 
tance as exp(jwl -- T's). Thus, it is possible to assume this from the start. 

On this basis the excitation of the circuit by a beam current of this form is 
evaluated (equation (2.10)). Conversely, the beam current due to a circuit 
voltage of this form is calculated (equation (2.22)). If these are to be con- 
sistent, the propagation constant I must satisfy a combined equation (2.23). 

The equation for the propagation constant is of the fourth degree in I, 
so that any disturbance of the circuit and electron stream may be expressed 
as a sum of four waves. 

Because some quantities are in practical cases small compared with others, 


it is possible to obtain good values of the roots by making an approximation. 
This reduces the equation to the third degree. The solutions are expressed 
in the form 


—[ = —7B. + BLE 
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Here 8, is a phase constant corresponding to the electron velocity (2.16) 
and C is a gain parameter depending on circuit and beam impedance (2.43). 
A solution of the equation for the case of an electron speed equal to the 
speed of the undisturbed wave yields 3 values of 6 which are shown in Fig. 
2.4. These represent an increasing, a decreasing and an unattenuated 
wave. The increasing wave is of course responsible for the gain of the tube. 
A different approximation yields the missing backward unattenuated wave 
(2.32). 

The characteristic impedance of the forward waves is expressed in terms 
of B., C, and 6 (2.36) and is found to differ little from the impedance in the 
absence of electrons. 

The gain of the increasing wave is expressed in terms of C and the length 
of the tube in wavelengths, V 


G = 47.3.CN db (2.37) 


It will be shown later that the gain of the tube can be expressed approxi- 
mately as the sum of the gain of the increasing wave plus a constant to take 
into account the setting up of the increasing wave, or the boundary condi- 
tions (2.39). 

Finally, the important gain parameter C is discussed. The circuit part of 
this parameter is measured by the cube root of an impedance, (E?/8?P)’, 


which relates the peak field E acting on the electrons, the phase constant 
8 = w/v, and the power flow. (E?/8?P)* is a measure of circuit goodness 
as far as gain is concerned. 

We should note also that a desirable circuit property is constancy of 
phase velocity with frequency, for the electron velocity must be near to the 
circuit phase velocity to produce gain. 

Evaluation of the effects of attenuation, of varying the electron velocity 
and many other matters are treated in later chapters. 


2.1 DESCRIPTION OF A TRAVELING-WAVE TUBE 


Figure 2.1 shows a typical traveling-wave tube such as may be used at 
frequencies around 4,000 megacycles. Such a tube may operate with a 
cathode current of around 10 ma and a beam voltage of around 1500 volts. 
There are two essential parts of a traveling-wave amplifier; one is the helix, ° 
which merely serves as a means for producing a slow electromagnetic wave 
with a longitudinal electric field; and the other is the electron flow. At the 
input the wave is transferred from a wave guide to the helix by means of a 
short antenna and similarly at the output the wave is transferred from the 
helix to a short antenna from which it is radiated into the output wave 
guide. The wave travels along the wire of the helix with approximately the 
speed of light. For operation at 1500 volts, corresponding to about jy the 
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speed of light, the wire in the helix will be about thirteen times as long as the 
axial length of the helix, giving a wave velocity of about ;’, the speed of 
light along the axis of the helix. A longitudinal magnetic focusing field of a 
few hundred gauss may be used to confine the electron beam and enable it 
to pass completely through the helix, which for 4000 megacycle operation 
may be around a foot long. 


OUTPUT 


Y 
COLLECTOR 
' 


Fig. 2.1-—Schematic of the traveling-wave amplifier. 


CLMETRIN oorereeireals 
BEAM REG 


ELECTROMAGNETIC WAVE TRAVELS 
FROM LEFT TO RIGHT ALONG HELIX 


Fig. 2.2—Portion of the traveling-wave amplifier pertaining to electronic interaction 


with radio-frequency fields and radio-frequency gain. 


In analyzing the operation of the traveling-wave tube, it is necessary to 
focus our attention merely on the two essential parts shown in Fig. 2.2, the 
circuit (helix) and the electron stream. 

2.2 Tue Type or ANALYsIS USED 


A mathematical treatment. of the traveling-wave tube is very important, 


not so much to give an exact numerical prediction of operation as to give a 
picture of the operation and to enable one to predict at least qualitatively 
the effect of various physical variations or features. It is unlikely that all of 
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the phenomena in a traveling-wave tube can be satisfactorily described in 
a theory which is simple enough to yield useful results. Most analyses, for 
instance, deal only with the small-signal or linear theory of the traveling- 
wave tube. The distribution of current in the electron beam can have an 
important influence on operation, and yet in an experimental tube it is often 
difficult to tell just what this distribution is. Even the more elaborate analy- 
ses of linear behavior assume a constant current density across the beam. 
Similarly, in most practical traveling-wave tubes, a certain fraction of the 
current is lost on the helix and yet this is not taken into account in the 
usual theories. 

It has been suggested that an absolutely complete theory of the traveling- 
wave tube is almost out of the question. The attack which seems likely to 
yield the best numerical results is that of writing the appropriate partial 
differential equations for the disturbance in the electron stream inside the 
helix and outside of the helix. This attack has been used by Chu and Jackson? 
and by Rydbeck.’ While it enables one to evaluate certain quantities which 
can only be estimated in a simpler theory, the general results do not differ 
qualitatively and are in fair quantitative agreement with those which are 
derived here by a simpler theory. 

In the analysis chosen here, a number of approximations are made at the 
very beginning. This not only simplifies the mathematics but it cuts down 
the number of parameters involved and gives to these parameters a simple 
physical meaning. In terms of the parameters of this simple theory, a great 
many interesting problems concerning noise, attenuation and various bound- 
ary conditions can be worked out. With a more complicated theory, the work- 
ing out of each of these problems would constitute essentially a new problem 
rather than a mere application of various formulae. 

There are certain consequences of a more general treatment of a traveling- 
wave tube which are not apparent in the simple theory presented here. 
Some of these matters will be discussed in Chapters XII, XIII and XIV. 

The theory presented here is a small signal theory. This means that the 
equations governing electron flow have been linearized by neglecting certain 
quantities which become negligible when the signals are small. This results 
in a wave-type solution. Besides the small signal limitation of the analyses 
presented here, the chief simplifying assumption which has been made is 
that all the electrons in the electron flow are acted on by the same a-c field, 
or at least by known fields. The electrons will be acted on by essentially the 
same field when the diameter of the electron beam is small enough or when 


2 L.J. Chu and J. D. Jackson, ‘Field Theory of Traveling-Wave Tubes,” Proc. J. R. E., 
Vol. 36, pp. 853-863, July 1948. 

’ Olof E. H. Rydbeck, “The Theory of the Traveling-Wave Tube,” Ericsson Technics, 
No. 46, 1948. 
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the electrons form a hollow cylindrical beam in an axially symmetrical cir- 
cuit, a case of some practical importance. 

Besides these assumptions, it is assumed in this section that the electrons 
are displaced by the a-c field in the axial direction only. This may be ap- 
proximately true in many cases and is essentially so when a strong magnetic 
focusing field is used. The effects of transverse motion will be discussed in 
Chapter XIII. 

In this chapter an approximate relation suitable for electron speeds small 
compared to the velocity of light is used in computing interaction between 
electrons and the circuit. 

A more general relation between impressed current and circuit field, valid * 
for faster waves, will be given in Chapter VI. Non-relativistic equations of 
motion will, however, be used throughout the book. With whatever speed 
the waves travel, it will be assumed that the electron speed is always small 
compared with the speed of light. 

We consider here the interaction between an electric circuit capable of 
propagating a slow electromagnetic wave and a stream of electrons. We can 
consider that the signal current in the circuit is the result of the disturbed 
electron stream acting on the circuit and we can consider that the disturbance 
on the electron stream is the result of the fields of the circuit acting on the 
electrons. Thus the problem naturally divides itself into two parts. 


2.3 THE FreLp CAUSED BY AN IMPRESSED CURRENT 


We will first consider the problem of the disturbance produced in the 
circuit by a bunched electron stream. In considering this problem in this sec- 
tion in a manner valid for slow waves and small electron velocities, we will 
use the picture in Fig. 2.3. Here we have a circuit or network with uniformly 


7 





Fig. 2.3-—Equivalent circuit of a traveling-wave tube. The distributed inductance 
and capacitance are chosen to match the phase velocity and field strength of the field act- 
ing on the electrons. The impressed current due to the electrons is — 07/0s, where 7 is the 
electron convection current. 


distributed series inductance and shunt capacitance and with current J and 
voltage V. The circuit extends infinitely in the s direction. An electron con- 
vection current 7 flows along very close to the circuit. The sum of the dis- 
placement and convection current into any little volume of the electron 
beam must be zero. Because the convection current varies with distance in 
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the direction of flow, there will be a displacement current J amperes per 
meter impressed on the transmission circuit. We will assume that the elec- 
tron beam is very narrow and very close to the circuit, so that the displace- 
ment current along the stream is negligible compared with that from the 
stream to the circuit. In this case the displacement current to the circuit will 
be given by the rate of change of the convection current with distance. 

If the convection current i and the impressed current J are sinusoidal 
with time, the equations for the network shown in Fig. 2.3 are 


. —jBV + J 
02 

aV on 
Pog jX1 

Here / and V are the current and the voltage in the line, B and X are the 
shunt susceptance and series reactance per unit length and J is the im- 
pressed current per unit length. 

It may be objected that these “network” equations are not valid for a 
transmission circuit operating at high frequencies. Certainly, the electric 
field in such a circuit cannot be described by a scalar electric potential. 
We can, however, choose BX so that the phase velocity of the circuit of 
Fig. 2.3 is the same as that for a particular traveling-wave tube. We can 
further choose XY /B so that, for unit péwer flow, the longitudinal field acting 
on the electrons according to Fig. 2.3,)that is, —dV/ds, is equal to the true 
field for a particular circuit. This lends a plausibility to the use of (2.1) and 
(2.2). The fact that results based on these equations are actually a good ap- 
proximation for phase velocities small compared with the velocity of light 
is established in Chapter VI. 

We will be interested in cases in which all quantities vary with distance 
as exp(—TIs). Under these circumstances, we can replace differentiation 
with respect to s by multiplication by —I°. The impressed current per unit 
length is given by 


Ja -= 
Equations (2.1) and (2.2) become 
—T/ = -—jBV+T% 
—TV = —jX/ 
If we eliminate /, we obtain 


V(t? + BX) = —j0Xi 
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Now, if there were no impressed current, the righthand side of (2.6) would 
be zero and (2.6) would be the usual transmission-line equation. In this case, 
I assumes a value I, , the natural propagation constant of the line, which 
is given by 


T; = jVBX (2.7) 
The forward wave on the line varies with distance as exp(—Iyz) and the 
backward wave as exp(+T\3). 

Another important property of the line itself is the characteristic im- 

pedance K, which is given by 

K = V/X/B (2.8) 
We can express the series reactance X in terms of T; and K 

X = —jkKT; (2.9) 


Here the sign has been chosen to assure that X is positive with the sign 
given in (2.7). In terms of T; and K, (2.6) may be written 


—Tl, Ki 
V = oe 
Gi os T;) (2.10) 


In (2.10), the convection current 7 is assumed to vary sinusoidally with 
time and as exp(—Iz) with distance. This current will produce the voltage 
V in the line. The voltage of the line given by (2.10) also varies sinusoidally 
with time and as exp(—Is) with distance. 


2.4 CONVECTION CURRENT PRODUCED BY THE FIELD 


The other part of thé problem is to find the disturbance produced on the 
electron stream by the fields of the line. In this analysis we will use the 
quantities listed below, all expressed in M.K.S. units.* 


n—charge-to-mass ratio of electrons 
n = 1.759 & 10" coulomb/kg 
uy-—average velocity of electrons 
voltage by which electrons are accelerated to give them the velocity 
Uy. Uy = VW 2nVo 
average electron convection current 
average charge per unit length 
po = —I/uUo 
a-c component of velocity 
a-c component of linear charge density 
a-c component of electron convection current 


* Various physical constants are listed in Appendix I. 
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The quantities v, p, and 7 are assumed to vary with time and distance as 
exp(jwt — Tz). 

One equation we have concerning the motion of the electrons is that the 
time rate of change of velocity is equal to the charge-to-mass ratio times 
the electric gradient. 

d v) aVv 

d(uo + 2) _ eimae (2.11) 

dt Oz 

In (2.11) the derivative represents the change of velocity observed in fol- 
lowing an individual electron. There is, of course, no change in the average 
velocity mu . The change in the a-c component of velocity may be expressed 
in terms of partial derivatives, re , which is the rate of change with time of 
, , ‘ ; : ov a — , 

the velocity of electrons passing a given point, and 3, which is variation of 


electron velocity with distance at a fixed time. 


dv ov Ov dz OV 


a lO > eae ie 


Equation (2.12) may be rewritten 


son + ~ (uo + 2) n— (2.13) 


al 


~ 


Now it will be assumed that the a-c velocity v is very small compared with 
the average velocity uo, and v will be neglected in the parentheses. The reason 
for doing this is to obtain differential equations which are linear, that is, 
in which products of a-c terms do not appear. Such linear equations neces- 
sarily give a wave type of variation with time and distance, such as we 
have assumed. The justification for neglecting products of a-c terms is that 
we are interested in the behavior of traveling-wave tubes at small signal 
levels, and that it is very difficult to handle the non-linear equations. When 
we have linearized (2.13) we may replace the differentiation with a respect 
to time by multiplication by jw and differentiation with respect to distance 
by multiplication by —I and obtain 


(jw ae Uol')v = —lV (2.14) 
We can solve (2.14) for the a-c velocity and obtain 
—nlV 
to( jBe — V) 


Be = w/Uo 
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We may think of 8, as the phase constant of a disturbance traveling with 
the electron velocity. 

We have another equation to work with, a relation which is sometimes 
called the equation of continuity and sometimes the equation of conserva- 
tion of charge. If the convection current changes with distance, charge 
must accumulate or decrease in any small elementary distance, and we see 
that in one dimension the relation obeyed must be 


01 Op 
-_= —— (2.17 
Oz at ) 

Again we may proceed as before and solve for the a-c charge density p 


—Ti = —jwp 


~jTi (2.18) 


@ 


p= 

The total convection current is the total velocity times the total charge 
density 

—Io +i = (uo + v)(po + p) (2.19) 


Again we will linearize this equation by neglecting products of a-c quanti- 
ties in comparison with products of a-c quantities and a d-c quantity. This 
gives us 


1 = pov + Mop (2.20) 


We can now substitute the value p obtained from (2.18) into (2.20) and solve 
for the convection current in terms of the velocity, obtaining 


Be pov 
[oe (2.21) 
Using (2.15) which gives the velocity in terms of the voltage, we obtain 
the convection current in terms of the voltage 


_jloBPV 
2Vo(j8e — 1)? 


(2.22) 


2.5 OVERALL CIRCUIT AND ELECTRONIC EQUATION 


In (2.22) we have the convection current in terms of the voltage. In (2.10) 
we have the voltage in terms of the convection current. Any value of I for 
which both of these equations are satisfied represents a natural mode of 
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propagation along the circuit and the electron stream. When we combine 
(2.22) and (2.10) we obtain as the equation which I’ must satisfy: 


ee a (2.23) 
2Vo(0; — I”)(78e — T) 


Equation (2.23) applies for any electron velocity, specified by 8,, and any 
wave velocity and attenuation, specified by the imaginary and real parts of 
the circuit propagation constant I; . Equation (2.23) is of the fourth degree. 
This means that it will yield four values of ! which represent four natural 
modes of propagation along the electron stream and the circuit. The circuit 
alone would have two modes of propagation, and this is consistent with the 
fact that the voltages at the two ends can be specified independently, and 
hence two boundary conditions must be satisfied. Four boundary conditions 
must be satisfied with the combination of circuit and electron stream. These 
may be taken as the voltages at the two ends of the helix and the a-c velocity 
and a-c convection current of the electron stream at the point where the 
electrons are injected. The four modes of propagation or the waves given by 
(2.23) enable us to satisfy these boundary conditions. 

We are particularly interested in a wave in the direction of electron flow 
which has about the electron speed and which will account for the observed 
gain of the traveling-wave tube. Let us assume that the electron speed is 
made equal to the speed of the wave in the absence of electrons, so that 


—T; = —jp. (2.24) 
As we are looking for a wave with about the electron speed, we will assume 
that the propagation constant differs from 8, by a small amount &, so that 
—P = —j6. + § 
| T+ 
Using (2.24) and (2.25) we will rewrite (2.23) as 


, 2 2 . 2 
—KIpB.(—B. — 278. + &) nae 
ls — 5 (2.26) 
2V0(2j8.— — §)(E*) 

Now we will find that, for typical traveling-wave tubes, — is much smaller 
than 8, ; hence we will neglect the terms involving 6,£ and & in the numera- 
tor in comparison with 8,’ and we will neglect the term £ in the denominator 

in comparison with the term involving 8,.£. This gives us 


3 KI 


e y 2.27 
av, (2.27) 


f= —j6 


While (2.27) may seem simple enough, it will later be found very convenient 
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to rewrite it in terms of other parameters, and we will introduce them 
now. Let 

KIo/4Vo med Cc (2.28) 


C is usually quite small and is typically often around .02. Instead of & we 
will use a quantity or a parameter 6 


— = B.C6 (2.29) 
In terms of 6 and C, (2.27) becomes 
6 = (=f) = (formes (2,30) 


This has three roots which will be called 6, , 62 and 6; , and these represent 
three forward waves. They are 


hb =e" = 3/2 — 5/2 
b& = = —4/3/2 — 7/2 (2.31) 
jx/2 


63 = e€ =] 


Figure 2.4 shows the three values of 6. Equation (2.23) was of the fourth 
degree, and we see that a wave is missing. The missing root was eliminated 


0.866 -j 0.5 


Fig. 2.4—There are three forward waves, with fields which vary with distance as 
exp(—jB8. + B-Cd)z. The three values of 6 for the case discussed, in which the circuit is 
lossless and the electrons move with the phase velocity of the unperturbed circuit wave, 
are shown in the figure. 


by the approximations made above, which are valid for forward waves only. 
The other wave is a backward wave and its propagation constant is found 


to be 


—T = jp, (1 _ S) (2.32) 


As C is a small quantity, C* is even smaller, and indeed the backward wave 
given by (2.32) is practically the same as the backward wave in the absence 
of electrons. This is to be expected. In the forward direction, there is a cumu- 
lative interaction between wave and the electrons because both are moving 
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at about the same speed. In the backward direction there is no cumulative 
action, because the wave and the electrons are moving in the opposite 
directions. 

The variation in the z direction for three forward waves is as 


exp --I's = exp —j§.2 exp 6CB,5 (2.33) 


We see that the first wave is an increasing wave which travels a little more 
slowly than the electrons. The second wave is a decreasing wave which 
travels a little more slowly than the electrons. The third wave is an un- 
attenuated wave which travels faster than the electrons. It can be shown 
generally that when a stream of electrons interacts with a wave, the electrons 
must go faster than the wave in order to give energy to it. 

It is interesting to know the ratio of line voltage to line current, or the 
characteristic impedance, for the three forward waves. This may be obtained 
from (2.5). We see that the characteristic impedance K,, for the nth wave is 
given in terms for the propagation constant for the nth wave, Tl’, by 


K, = V/I = jX/T, (2.34) 
In terms of 6, this becomes 

K, = KU — 8.C4,/T)) (2.35) 

K, = K(i — jCé,) (2.36) 


We see that the characteristic impedance for the forward waves differs from 
the characteristic impedance in the absence of electrons by a small amount 
proportional to C, and that the characteristic impedance has a small reactive 
component. 

We are particularly interested in the rate at which the increasing wave 
increases. In a number of wave lengths .V, the total increase in db is given by 


20 logio exp [(4/3/2)(C)(24N)] db 
= 47.3CN db 


(2.37) 


We will see later that the overall gain of the traveling-wave tube with a 
uniform helix can be expressed in the form 


G = A + BCN db (2.38) 


Here A is a loss relating voltage associated with the increasing wave to 
the total applied voltage. This loss may be evaluated and will be evaluated 
later by a proper examination of the boundary conditions at the input of 
the tube. It turns out that for the case we have considered 


G = —9.54 + 47.3 CN db (2.39) 
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In considering circuits for traveling-wave tubes, and in reformulating 
the theory in more general terms later on, it is valuable to express C in terms 
of parameters other than the characteristic impedance. Two physically sig- 
nificant parameters are the power flow in the circuit and the electric field 
associated with it which acts on the electron stream. The ratio of the square 
of the electric field to the power can be evaluated by physical measurement 
even when it cannot be calculated. For instance, Cutler’ did this by allowing 
the power from a wave guide to flow into a terminated helix, so that the 
power in the helix was the same as the power in the wave guide. He then 
compared the field in the helix with the field in the wave guide by probe 
measurements. The field strength in the wave guide could be calculated in 
terms of the power flow, and hence Cutler’s measurements enabled him to 
evaluate the field in the helix for a given power flow. 

The magnitude of the field is given in terms of the magnitude of the 
voltage by 


E=|rv| (2.40) 


Here E is taken as the magnitude of the field. The power flow in the circuit 
is given in terms of the circuit voltage by 


P = |V /?/2K (2.41) 
A quantity which we will use as a circuit parameter is 
E?/@P = 2K (2.42) 


Here it has been assumed that we are concerned with low-loss circuits, so 
that Ij can be replaced by the phase constant 6%. Usually, 8 can be taken 
as equal to 6,, the electron phase constant, with small error, and in the 
preceding work this has been assumed to be exactly true in (2.23). 

In terms of this new quantity, C is given by 


C3 = (2K)(Io/8Vo) = (E?/8?P)(1o/8V 0) (2.43) 


If we call Vo/J» the beam impedance, C* is } the circuit impedance divided 
by the beam im, edance. It would have been more sensible to use E?/26?P 
instead of E*/6?P. Unfortunately the writer feels stuck with his benighted 
first choice because of the number of curves and published equations which 
make use of it. 

Besides the circuit impedance, another important circuit parameter is 
the phase velocity. As the electron velocity is made to deviate from the 
phase velocity of the circuit, the gain falls off. An analysis to be given later 


>C. C. Cutler, “Experimental Determination of Helical-Wave Properties,’ Proc. 1 RE, 
Vol. 36, pp. 230-233, February 1948. 
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discloses that the allowable range of velocity Av is of the order of 


Av Y + Cuo (2.44) 


Thus, the allowable difference between the phase velocity of the circuit and 
the velocity of the electrons increases as circuit impedance and beam current 
are increased and decreases as voltage is increased. 

We have illustrated the general method of attack to be used and have 
introduced some of the important parameters concerned with the circuit 
and with the overall behavior of the tube. In later chapters, the properties 
of various circuits suitable for traveling-wave tubes will be discussed in 
terms of impedance and phase velocity and various cases of interest will be 
worked out by the methods presented. 
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CHAPTER III 
THE HELIX 
SYNOPSIS OF CHAPTER 


NY circuit capable of propagating a slow electromagnetic wave can be 

used in a traveling-wave tube. The circuit most often used is the helix. 

The helix is easy to construct. In addition, it is a very good circuit. It has a 

high impedance and a phase velocity that is almost constant over a wide 
frequency range. 

In this chapter various properties of helices are discussed. An approximate 
expression for helix properties can be obtained by calculating the properties, 
not of a helix, but of a helically conducting cylindrical sheet of the same 
radius and pitch as the helix. An analysis of such a sheet is carried out in 
Appendix II and the results are discussed in the text. 

Parameters which enter into the expressions are the free-space phase con- 
stant By = w/c, the axial phase constant 8 = w/v, where v is the phase 
velocity of the wave, and the radial phase constant y. The arguments of 
various Bessel functions are, for instance, yr and ya, where r is the radial 
coordinate and a is radius of the helix. The parameters 6, 8 and y are 
related by 


B= B+ 7 


For tightly wound helices in which the phase velocity v is small compared 
with the velocity of light, y is very nearly equal to 8. For instance, at a 
velocity corresponding to that of 1,000 volt electrons, y and 8 differ by 
only 0.4%. 

Figure 3.1 illustrates two parameters of the helically conducting sheet, 
the radius a and pitch angle y. For an actual helix, @ will be taken to mean 
the mean radius, the radius to the center of the wire. 

Figure 3.2 shows a single curve which enables one to obtain y, and hence 
8, for any value of the parameter 


wa cot y 


Boa coty = 


This parameter is proportional to frequency. The curve is an approximate 
representation of velocity vs. frequency. At high frequencies y approaches 
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Bo cot y and 6 thus approaches 8)/sin y; this means that the wave travels 
with the velocity of light around the sheet in the direction of conduction. 
In the case of an actual helix, the wave travels along the wire with the 
velocity of light. 

The gain parameter C is given by 


C = (Io/8Vo)'""(E*/8°P)"” 


Values of (E?/?P)"* on the axis may be obtained through the use of Fig. 3.4, 
where an impedance parameter F (ya) is plotted vs. ya, and by use of (3.9). 
For a given helix, (E?/?P)"* is approximately proportional to F(ya). F(ya) 
falls as frequency increases. This is partly because at high frequencies and 
short wavelengths, for which the sign of the field alternates rapidly with 
distance, the field is strong near the helix but falls off rapidly away from the 
helix and so the field is weak near the axis. At very high frequencies the field 
falls off away from the helix approximately as exp(—~yAr), where Ar is dis- 
tance from the helix, and we remember that y is very nearly proportional to 
frequency. (E£?/8?P)"* measured at the helix also falls with increasing 
frequency. 

In many cases, a hollow beam of radius r (the dashed lines of Fig. 3.5 
refer to such a beam) or a solid beam of radius r (the solid lines of Fig. 3.5 
refer to such a beam) is used. For a hollow beam we should evaluate E? in 
(E2/g?P)"* at the beam radius, and for a solid beam we should use the mean 
square value of E averaged over the beam. 

The ordinate in Fig. 3.5 is a factor by which (£*/6?P)'? as obtained from 
Fig. 3.4 and (3.9) should be multiplied to give (£°/6?P)'’* for a hollow or 
solid beam. 

The gain of the increasing wave is proportional to F(ya) times a factor 
from Fig. 3.5, and times the length of the tube in wavelengths, \V. NV is very 
nearly proportional to frequency. Also y, and hence ya, are nearly propor- 
tional to frequency. Thus, F (ya) from Fig. 3.4 times the appropriate factor 
from Fig. 3.5 times ya gives approximately the gain vs. frequency, (if we 
assume that the electron speed matches the phase velocity over the fre- 
quency range). This product is plotted in Fig. 3.6. We see that for a given 
helix size the maximum gain occurs at a higher frequency and the band- 
width is broader as r/a, the ratio of the beam radius to the helix radius, 
is made larger. 

It is usually desirable, especially at very short wavelengths, to make the 
helix as large as possible. If we wish to design the tube so that gain is a maxi- 
mum at the operating frequency, we will choose a so that the appropriate 
curve of Fig. 3.6 has its maximum at the value of ya corresponding to the 
operating frequency. We see that this value of a will be larger the larger is 
r/a. In an actual helix, the maximum possible value of r/a is less than unity, 
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since the inside diameter of the helix is less than a by the radius of the wire. 
Further, focusing difficulties preclude attaining a beam radius equal even to 
the inside radius of the helix. 

Experience indicates that at very short wavelengths (around 6 milli- 
meters, say) it is extremely important to have a well-focused electron beam 
with as large a value of r/a as is attainable. 

A characteristic impedance K, may be defined in terms of a ‘‘transverse”’ 
voltage V;, obtained by integrating the peak radial field from a to ~, and 
from the power flow. In Fig. 3.7, (v/c) Ky is plotted vs. ya. A “longitudinal” 
characteristic impedance K¢ is related to K, (3.13). For slow waves K¢ 
is nearly equal to K,. The impedance parameter E*/8°P evaluated at the 
surface of the cylinder is twice K¢. We see that K¢ falls with increasing 
frequency. 

A simplified approach in analysis of the helically conducting sheet is that 
of “developing” the sheet; that is, slitting it normal to the direction of con- 
duction and flattening it out as in Tig. 3.8. The field equations for such a 
flattened sheet are then solved. For large values of ya the field is concentrated 
near the helically conducting sheet, and the fields near the developed sheet 
are similar to the fields near the cylindrical sheet. Thus the dashed line 
in Fig. 3.7 is for the developed sheet and the solid line is for a cylindrical 
sheet. 

For the developed sheet, the wave always propagates with the speed of 
light in the direction of conduction. In a plane normal to the direction of 
conduction, the field may be specified by a potential satisfying Laplace’s 
equation, as in the case, for instance, of a two-wire or coaxial line. Thus, 
the fields can be obtained by the solution of an electrostatic problem. 

One can develop not only a helically conducting sheet, but an actual 
helix, giving a series of straight wires, shown in cross-section in Fig. 3.9. 
In Case I, corresponding to approximately two turns per wavelength, suc- 
cessive wires are —, +, —, + etc.; in case II, corresponding to approxi- 
mately four turns per wavelength, successive wires are +, 0, —, 0, +, 0 etc. 

Figures 3.10 and 3.11 illustrate voltages along a developed sheet and a 
developed helix. 

Figure 3.13 shows the ratio, R'*, of (4°/8°?P)'/* on the axis to that for a 
developed helically conducting sheet, plotted vs. d/p. We see that, for a 
large wire diameter d, (?/8??)!'/* may be larger on the axis than for a heli- 
cally conducting sheet with the same mean radius and hence the same pitch 
angle and phase velocity. This is merely because the thick wires extend nearer 
to the axis than does the sheet. The actual helix is really inferior to the 
sheet. 

We see this by noting that the highest value of (E?/s?P)'/8 for a helically 
conducting sheet is that at the sheet (r = a). With a finite wire size, the 
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largest value r can have is the mean helix radius @ minus the wire radius. 
In Fig. 3.14, the ratio of (E°/8°P)'* for this largest allowable radius to 
(E*/p°P)'’3 at the surface of the developed sheet is plotted vs. d/p. We see 
that, in terms of maximum available field, (£?/8°P)'/* is no more than 0.83 as 
high as for the sheet for four turns per wavelength and 0.67 as high as for the 
sheet for two turns per wavelength. We further see that there is an optimum 
ratio of wire diameter to pitch; about 0.175 for four turns per wavelength 
and about 0.125 for two turns per wavelength. Because the maxima are so 
broad, it is probably better in practice to use larger wire, and in most tubes 
which have been built, d/p has been around 0.5. 

In designing tubes it is perhaps best to do so in terms of field on the axis 
(Fig. 3.13), the allowable value of r/a and the curves of Fig. 3.6. 

Figure 3.15 compares the impedance of the developed helix with that of 
the developed sheet as given by the straight line of Fig. 3.7. 

There are factors other than wire size which can cause the value of E?/6?P 
for an actual helix to be less than the value for the helically conducting 
sheet. An important cause of impedance reduction is the influence of di- 
electric supporting members. Even small ceramic or glass supporting rods 
can cause some reduction in helix impedance. In some tubes the helix is 
supported inside a glass tube, and this can cause a considerable reduction 
in helix impedance. 

When a field analysis seems too involved, it may be possible to obtain 
some information by considering the behavior of transmission lines having 
parameters adjusted to make the phase constant and the character'stic im- 
pedance equal to those of the helix. For instance, suppose that the presence 
of dielectric material results in an actual phase constant 84 as opposed to a 
computed phase constant 3. Equation (3.64) gives an estimate of the con- 
sequent reduction of (£?/8°P)'® on the axis. 

This method is of use in studying the behavior of coupled helices. For 
instance, concentric helices may be useful in producing radial fields in tubes 
in which transverse fields predominate in the region of electron flow (see 
Chapter XIII). A concentric helix structure might be investigated by means 
of a field analysis, but some interesting properties can be deduced more 
simply by considering two transmission lines with uniformly distributed self 
and mutual capacitances and inductances, or susceptance and reactances. 
The modes of propagation on such lines are affected by coupling in a manner 
similar to that in which the modes of two resonant circuits are affected by 
coupling. 

If two lines are coupled, their two independent modes of propagation are 
mixed up to form two modes of propagation in which both lines participate. 
If the original phase velocities differ greatly, or if the coupling between the 
lines is weak, the fields and velocity of one of these modes will be almost 
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like the original fields and velocity of one line, and the fields and velocity of 
the other mode will be almost like the original fields and velocity of the other 
line. However, if the coupling is strong enough compared with the original 
separation of phase velocities, both lines will participate almost equally in 
each mode. One mode will be a ‘“‘longitudinal mode”’ for which the excitations 
on the two lines are substantially equal, and the other mode will be a “‘trans- 
verse”’ mode for which the excitations are substantially equal and opposite. 

The ratios of the voltages on the lines for the two modes are given by 
(3.75). Here it is assumed that the series reactances X and shunt susceptances 
B of the lines are almost equal, differing only enough to make a difference 
AT in the propagation constants. By, and Xj. are the mutual susceptance 
and reactance. We see that to make the voltages on the two lines nearly 
equal or equal and opposite, B;2 and Xj. should have the same sign, so that 
capacitive and inductive couplings add. 


XN 


Fig. 3.1—A helically conducting sheet of radius a. The sheet is conducting along helical 
paths making an angle y with a plane normal to the axis. 


Increasing the coupling increases the velocity separation between the two 
modes, and this is desirable. When there is a substantial difference in ve- 
locity, operation in the desired mode can be secured by making the electron 
velocity equal to the phase velocity of the desired mode. 

To make the capacitive and inductive couplings add in the case of con- 
centric helices (Fig. 3.17), the helices should be wound in opposite directions. 


3.1 THe HELICALLY CONDUCTING SHEET 


In computing the properties of a helix, the actual helix is usually replaced 
by a helically conducting cylindrical sheet of the same mean radius. Such a 
sheet is illustrated in Fig. 3.1. This sheet is perfectly conducting in a helical 
direction making an angle y, the pitch angle, with a plane normal to the 
axis (the direction of propagation), and is non-conducting in a helical direction 
normal to this y direction, the direction of conduction. Appropriate solutions 
of Maxwell’s equations are chosen inside and outside of the cylindrical sheet. 
At the sheet, the components of the electric field in the y direction are made 
zero, and those normal to the y direction are made equal inside and outside. 
Since there can be no current in the sheet normal to the y direction, the 
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components of magnetic field in the y direction must be the same inside and 
outside of the sheet. When these boundary conditions are imposed, one can 
solve for the propagation constant and E*/6?P can then be obtained by 
integrating the Poynting vector. 

The helically conducting sheet is treated mathematically in Appendix IT. 
The results of this analysis will be presented here. 

















| 

















3 
Bo 8 COTY 
Fig. 3.2—-The radial propagation constant isy = (# — B3)'/2, Here (Bo/y) cot y is 
plotted vs Boa cot y, a quantity proportional to frequency. For slow waves the ordinate is 
roughly the ratio of the wave velocity to the velocity the wave would have if it traveled 
along the helically conducting sheet with the speed of light in the direction of conduction 


3.1a The Phase Velocity 


The results for the helically conducting sheet are expressed in terms of 
three phase or propagation constants. These are 


Bo = w/C, B= w/v (3.1) 


1=Ve- pg (3.2) 
y = BV1 — (/c)? (3.3) 


Here c is the velocity of light and v is the phase velocity of the wave. By is 
the phase constant of a wave traveling with the speed of light, which would 
vary with distance in the z direction as exp(—j@oz). The actual axial phase 
constant is 8, and the fields vary with distance as exp(—j@z). 

y is the radial propagation constant. Various field components vary as 
modified Bessel functions of argument yr, where r is the radius. Particularly, 
the longitudinal electric field, which interacts with the electrons, varies 
as Io(yr). 

For the phase velocities usually used, y is very nearly equal to 8, as may 
be seen from the following table of accelerating voltages V» (to give an elec- 
tron the velocity v), o/c and y/8. 
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V v/¢ ¥/B 
100 .0198 1.000 
1,000 0625 998 
10 ,000 . 1980 .980 


Figure 3.2 gives information concerning the phase velocity of the wave 
in the form of a plot of (8/y) cot y as a function of Bp a cot yp. 
The ratio of the phase velocity v to the velocity of light cmay be expressed 


Bo/B = (y/B)(Bo/y) cot y tan y (3.4) 
(y/B) tan W [(Bo/y) cot y ] 
































BoacoTy 

Fig. 3.3-——From these curves one can obtain v/c, the ratio of the phase velocity of the 
wave to the velocity of light, for various values of tan y and Aya cot y. 
From Fig. 3.2 we see that, for large values of Boa cot y, (80/7) cot p ap- 
proaches unity. For slow waves y/8 approaches unity. Under these circum- 
stances, very nearly 

v/¢c = tany (3.5) 

If the wave traveled in the direction of conduction with the speed of light 
we would have 


= sin yp 
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: 
This is essentially the same as (3.5) for small pitch angles p. Thus, for large 
values of the abscissa in Fig. 3.2, the phase velocity is just about that corre- 
sponding to propagation along the sheet in the direction of conduction with 
the speed of light and hence in the axial direction at a much reduced speed. 
For helices of smaller radius compared with the wavelength, the speed is 
greater. 

The bandwidth of a traveling-wave tube is in part determined by the 
range over which the electrons keep in step with the wave. The abscissa of 
Fig. 3.2 is proportional to frequency, but the ordinate is not strictly propor- 
tional to phase velocity. Hence, it seems desirable to have a plot which does 
show velocity directly. To obtain this we can assign various values to cot y. 





F (7a) = 7154e70.666478 

















7a 
Fig. 3.4—A curve giving the impedance function F(ya) vs. ya. On the axis, (#?7/8°P)' = 
(B/By)'3(y¥/B)*3F (ya). 
The ordinate (8)/y) cot y then gives us y/8» and from (3.2) we see that 


v/¢ = Bo/B = (1 + (y/Bo)*)7” (3.6) 


We have seen that, for large values of Soa cot y, (80/7) cot ¥ approaches 
unity, and v/c approaches a value 


v/e = (1+ cot*y) ‘” = siny (3.7) 


To emphasize the change in velocity with frequency it seems best to plot the 
difference between the actual velocity ratio and this asymptotic velocity 
ratio on a semi-log scale. Accordingly, Fig. 3.3 shows (v/c)—sin p vs. Boa 
cot y for tan y = .05, .075, .1, .15, .2. 

For large values of the abscissa the velocities are those corresponding to 
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about 640 volts (tan ¥ = .05), 1,400 volts (.075), 2,500 volts (.1), 5,600 volts 
(.15), 9,800 volts (.2). 
3.1b The Impedance Parameter (E°/6'P) 


Figure 3.4 shows a plot of a quantity (ya) vs. ya. This quantity is com- 
puted from a very complicated expression (Appendix IT), but it is accurately 
given over the range shown by the empirical relation 


F(ya) = 7.154 « (3.8) 
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Fig. 3.5— Factors by which (#78/?P)"/3 on the axis should be multiplied to give the cor 
rect value for hollow and solid beams of radius r. 


For the field on the axis of the helix, 


(E?/B?P)'® = (B/Bo)'(y/B)*3F (ya) (3.9) 


We should remember that 8/89 = c/v and that y/@ is nearly unity for veloci- 
ties small compared with the velocity of light. 

In the expression for the gain parameter C, the square of the field E is 
multiplied by the current J, (2.28). If we were to assume that two electron 
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streams of different currents, J; and J; , were coupled to the circuit through 
transformers, so as to be acted on by fields E; and £2, but that the streams 
did not interact directly with one another, we would find the effective value 
of C* to be given by 


C* = (Ei/8’P)(1,/8Vo) + (E2/B°P)(12/8V 0) 


Thus, if we neglect the direct interaction of electron streams through fields 
due to local space charge, we can obtain an effective value of C* by integrat- 
ing E*dJo over the beam. If we assume a constant current density, we can 
merely use the mean square value of E over the area occupied by electron 
flow. 

The axial component of electric field at a distance r from the axis is /(yr) 
times the field on the axis. Hence, if we used a tubular beam of radius r, we 
should multiply (Z2/8?P)'” as obtained from Fig. 3.4 by| Jo(yr)|?°. The quan- 
tity [Zo(yr)]?/* is plotted vs. ya for several values of r/a as the dashed lines 
in Fig. 3.5. 

Suppose the current density is uniform out to a radius r and zero beyond 
this radius. The average value of E* is greater than the value on the axis by 
a factor [/2(yr) — 1; (yr)| and (E2/8°P)'* from Fig. 3.4 should in this case 
be multiplied by this factor to the } power. The appropriate factor is plotted 
vs. ya as the solid lines of Fig. 3.5. 

We note from (2.39) that the gain contains a term proportional to C.V, 
where JN is the number of wavelengths. For slow waves and usual values of 
ya, very nearly, NV will be proportional to the frequency and hence to y, 
while C is proportional to (E?/6?P)'/*. We can obtain (£°/8?P)'* from Figs. 
3.4 and 3.5. The gain of the increasing wave as a function of frequency will 
thus be very nearly proportional to this value of (£?/8?P)'/* times y, or, 
times ya if we prefer. 

In Fig. 3.6, yaF (ya) is plotted vs. ya for hollow beams of radius r for 
various values of r/a (dashed lines) and for uniform density beams of 
radius r for various values of r/a (solid lines). If we assume that the electron 
speed is adjusted to equal the phase velocity of the wave, we can take the 
ordinate as proportional to gain and the abscissa as proportional to 
frequency. 

We see that the larger is r/a, the larger is the value of ya, for maximum 
gain. For one typical 7.5 cm wavelength traveling-wave tube, ya was about 
2.8. For this tube, the ratio of the inside radius of the helix to the mean radius 
of the helix was 0.87. We see from Fig. 3.6 that, if a solid beam just filled 
this helix, the maximum gain should occur at about the operating wave- 
length. As a matter of fact, the beam was somewhat smaller than the inside 
diameter of the helix, and there was an observed increase of gain with an 
increase in wavelength (a higher gain at a lower frequency). In a particular 
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tube for 0.625 cm wavelength, it was felt desirable to use a relatively large 
helix diameter. Accordingly, a value of ya of 6.7 was chosen. We see that, 
unless r/a is 0.9 or larger, this must result in an appreciable increase in gain 
at some frequency lower than operating frequency. It was only by use of 
great care in focusing the beam that gain was attained at 0.625 cm wave- 
length, and there was a tendency toward oscillation, presumably at longer 
wavelengths. This discussion of course neglects the effect of transmission 
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Fig. 3.6—-The ordinate is yaF (ya) times the parameters from Fig. 3.5. For a fixed cur 
rent and voltage it is nearly proportional to gain per unit length, and hence the curves 
give roughly the variation of gain with frequency. 


loss or gain. Usually the loss decreases when the frequency is decreased, 
and this favors oscillation at low frequencies. 


3.1¢ Impedance of the Helix 


No impedance which can be assigned to the helically conducting sheet 
can give full information for matching a helix to a waveguide or transmission 
line. As in the case of transducers between a coaxial line and a waveguide or 
between waveguides of different cross-section, the impedance is important, 
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but discontinuity effects are also important. However, a suitably defined 
helix impedance is of some interest. 

Figure 3.7 presents the impedance as defined on a voltage-power basis. 
The peak “‘transverse” voltage V, is obtained by integrating the radial elec- 
tric field from the radius a of the helically conducting sheet to ©. The 
“‘transverse”’ characteristic impedance K, is defined by the relation 


= (4)(V%/K,) 


{ } 
04 06 081.0 3 45678 10 20 630 
7a 


Fig. 3.7—Curves giving the variation of transverse impedance, A, , with ya. 


The impedance is found to be given by 


BY’ (Bo\ ,- _ 12075 
(2) (*) x. af (ya)? [(1+4 + i, K mY a - To 12) 
Io I,K = Bi ! 
. (=) ( + TK, ‘) (KoK> KD | 


The /’s and A’s are modified Bessel functions of argument ya. 
The dashed line on Fig. 3.7 is a plot of 30/ya vs. ya. It may be seen that, 
for large values of ya, very nearly 


(3.10) 


K, = (8/Bo)(v/B)*(30/ya) (3.11) 
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and in the whole range shown the impedance differs from this value by a 
factor less than 1.5. 

We might have defined a “longitudinal” voltage V¢ as half of the integral 
of the longitudinal component of electric field at the surface of the helically 
conducting sheet for a half wavelength (between successive points of zero 
field). We find that 


Ve= V1 — (0/c)? Vi = (y/B)Vi (3.12) 
and, accordingly, the “longitudinal impedance” K¢ will be 

Ke = [1 — (0/c)"|Ki = (7/8)*K. (3.13) 
Our impedance parameter, F°/8°P, is just twice this ‘longitudinal 
impedance.” 
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éFig. 3.8—A “developed” helically conducting sheet. The sheet has been slit along a 
line normal to the direction of conduction and flattened out. 
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The transverse voltage V, is greater than the longitudinal voltage V¢ 
because of the circumferential magnetic flux outside of the helix. For slow 
waves V’¢ is nearly equal to V, and the fields are nearly curl-free solutions of 
Laplace’s equation. In this case the circumferential magnetic flux is small 
compared with the longitudinal flux inside of the helix. 

For the circuit of Fig. 2.3 the transverse and longitudinal voltages are 
equal, and it is interesting to note that this is approximately true for slow 
waves on a helix. For very fast waves, the longitudinal voltage becomes small 
compared with the transverse voltage. 

For a typical 4,000-megacycle tube, for which ya = 2.8, Fig. 5 indicates a 
value of AK, of about 150 ohms. 


3.2 Tuk DEVELOPED HELIX 


For large helices, i.e., for large values of ya, the fields fall off very rapidly 
away from the wire. Under these circumstances we can obtain quite accurate 
results by slitting the helically conducting sheet along a spiral line normal 
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to the direction of conduction and flattening it out. This gives us the plane 
conducting sheet shown in Fig. 3.8. The indicated coordinates are z to the 
right and y upward: «x is positive into the paper. The fields about the de- 
veloped sheet approximate those about the helically conducting sheet for 
distances always small compared with the original radius of curvature. 

The straight dashed line shown on the helix impedance curve of Fig. 3.7 
can be obtained as a solution for the “developed helix.’’ We see that it is 
within 10% of the true curve for values of ya greater than 2.8. We might note 
that a 10% error in impedance means only a 34% error in the gain 
parameter C. 

In solving for the fields around the sheet, the developed surface can be 
extended indefinitely in the plus and minus y directions. In‘order that the 
fields may match when the sheet is rolled up, they must be the same at 
y = 0,2 = 2rasiny and y = 27a cos y, 3 = 0. The appropriate solutions 
are plane electromagnetic waves traveling in the y direction with the speed 
of light. 

For positive values of x, the appropriate electric and magnetic fields are 


Ez = Ee ™ ev" e Pow 
E, = jE & 3 & *™ (3.14) 
E, = 0 
We should note that the x and z components of the field can be obtained 
as gradients of a function 
® = —(Ep/y)e* & 3 (3.15) 
where 
— 0’/ dz 
E, = —0/dy 
Pb/dx? + Ab/dz? = 0 (3.17) 


(3.16) 


Thus, in the «z plane, ® satisfies Laplace’s equation. 

— . - . . 6 . . ° ‘ 

rhe magnetic field is given by the curl’ of the electric field times j/wy. 
Its components are: 


} yp —v2 578 38 
H, d Zoe Ye J78 6 IP oN 


H, come Eo ete im e Fou 


Hy, 


5 Maxwell’s equations are given in Appendix I. 
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The fields in the —x direction may be obtained by substituting exp(yx) 
for exp(—yx). 

If the sheet is to roll up properly, the points a on the bottom coinciding 
with the points 6 on the top, we have 


2rya sin Y — 2mBoa cos WY = 2nr (3.19) 


where # is an integer. 

The solution corresponding most nearly to the wave on a singly-wound 
helix is that for m = 0. The others lead to a variation of field by » cycles 
along a circumferential line. These can be combined with the x = 0 solu- 
tion to give a solution for a developed helix of thin tape, for instance. Or, 
appropriate combinations of them can represent modes of helices wound of 
several parallel wires. For instance, we can imagine winding a balanced trans- 
mission line up helically. One of the modes of propagation will be that in 
which the current in one wire is 180° out of phase with the current in the 
other. This can be approximated by a combination of the m = +1 and 
n = —1 solutions. This mode should not be confused with a fast wave, a 
perturbation of a transverse electromagnetic wave, which can exist around 
an unshielded helix. 

Usually, we are interested in the slow wave on a singly-wound helix, and 
in this case we take n = 0 in (3.19), giving 


ysiny — Bocosy = 0 
tan y = Bo/¥ 


(3.20) 


8 
3 a yi (3.21) 


sny = 


cos (3.22) 


pie. ue 
"“@V+n" 


Let us evaluate the propagation constant in the axial direction. From Fig. 
3.8 we see that, in advancing unit distance in the axial direction, we pro- 
ceed a distance cos y in the z direction and sin y in the y direction. Hence, 
the phase constant 8 in the axial direction must be 


8 = Bosiny + y cosy (3.23) 
Using (3.18) and (3.19), we obtain 

B= (B+ 7)” (3.24) 

7 = (8 — Bo)” (3.25) 


These are just relations (3.2, 3.3). 
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The power flow along the axis is that crossing a circumferential circle, 
represented by lines a-b in Fig. 3.8. As the power flows in the y direction, 
this is the power associated with a distance 27a sin y in z direction. Also, 
the power flow in the + region will be equal to the power flow in the —x 
region. Hence, the power flow in the helix will be twice that in the region 
x= Otox = +%,2s = Otos = 2rasiny. 


P=2 


2xasiny 


/ (1)(E,H? — E,H?) dx dz (3.26) 
z= 


z==0 
This is easily integrated to give 
_ 2a sin PEo 


yee 


P 


The magnitude E of the axial component of field is 
E= E, cosy (3.28) 
Using (3.21), (3.22), (3.24) and (3.28) in connection with (3.27) we obtain 
(E2/8°P) = (y/8)*(B/Bo)(uc/2rya) (3.29) 
We have 
uc = p/Vue = VWu/e = 377 ohms 


Thus 
F?/p?P = (y B)‘(B Bo) (60 ya) (3.30) 


The longitudinal impedance is half this, and the transverse impedance is 
(8/y)? times the longitudinal impedance. 


3.3 EFFECT OF WIRE SIZE 


An actual helix of round wire, as used in traveling-wave tubes, will of 
course differ somewhat in properties from the helically conducting sheet 
for which the foregoing material applies. 

One might expect a small difference if there were many turns per wave- 
length, but actual tubes often have only a few turns per wavelength. For 
instance, a typical 4,000 mc tube has about 4.8 turns per wavelength, while 
a tube designed for 6 mm operation has 2.4 turns per wavelength. 

If the wire is made very small there will be much electric and magnetic 
energy very close to the wire, which is not associated with the desired field 
component (that which varies as exp(—j@z) in the z direction). If the wire 
is very large the internal diameter of the helix becomes considerably less 
than the mean diameter, and the space available for electron flow is reduced. 
As the field for the helically conducting sheet is greatest at the sheet, this 
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means that the maximum available field is reduced. Too, the impedance 
will depend on wire size. 

It thus seems desirable to compare in some manner an actual helix and the 
helically conducting sheet. It would be very difficult to solve the problem 
of an actual helix. However, we can make an approximate comparison by 
a method suggested by R. S. Julian. 

In doing this we will develop the helix of wires just as the helically con- 


Fig. 3.9—The wires of a developed helix with about two turns per wavelength (case I) 
and about four turns per wavelength (case IT). In the analysis used, the wires are not 


quite round. 


<---> --—> 


| 


Fig. 3.10—Voltages on a developed helically conducting sheet for two turns per wave- 
length. 


ducting sheet was developed, by slitting it along a helical line normal to the 
wires. We will then consider two special cases, one in which the wires of the 
developed helix are one half wavelength long and the other in which the 


wires are one quarter wavelength long. 

The waves propagated on the developed helix are transverse electromag- 
netic waves propagated in the direction of the wires, and the electric fields 
normal to the direction of propagation can be obtained from a solution of 
Laplace’s equation in two dimensions (as in (3.15)—(3.17)). 
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It is easy to make up two-dimensional solutions of Laplace’s equation 
with equipotentials or conductors of approximately circular form, as shown 
in Fig. 3.9. In case I, the conductors are alternately at potentials —-V,+V, 
—V, etc.; and in case II, the potentials are —V, 0, +V, 0, —V, 0, +V, 
etc. Far away in the x direction from such a series of conductors, the field 
will vary sinusoidally in the z direction and will vary in the same manner 
with x as in the developed helically conducting sheet. Hence, we can make 
the distant fields of the conductors of cases I and II of Fig. 3.9 equal to the 
distant fields of developed helically conducting sheets, and compare the 
E*/8?P and the impedance for the different systems. Case I would correspond 
to a helix of approximately two turns per wavelength and case II to four 
turns per wavelength. 


3.3a Two Turns per Wavelength 


Figure 3.10 is intended to illustrate the developed helically conducting 
sheet. The vertical lines indicate the direction of conduction. The dashed 
slanting lines are intersections of the original surface with planes normal to 
the axis. That is, on the original cylindrical surface they were circles about 


the surface, and they connect positions along the top and bottom which 
should be brought together in rolling up the flattened surface to reconsti- 
tute the helically conducting sheet. 

Waves propagate on the developed sheet of Fig. 3.10 vertically with the 
speed of light. The vertical dimension of the sheet is in this case taken as 
\/2, where is the free-space wavelength.’ The sine waves above and below 
Fig. 3.10 indicate voltages at the top and the bottom and are, of course, 
180° out of phase. As is necessary, the voltages at the ends of the dashed 
slanting lines, (really, the voltages at the same point before the sheet was 
slit) are equal. 

A wave sinusoidal at the bottom of the sheet, zero half way up and 180 
out of phase with the bottom at the top would constitute along any horizon- 
tal line a standing wave,not a traveling wave. Actually, this is only one com- 
ponent of the field. The other is a wave 90° out of phase in both the horizon- 
tal and vertical directions. Its maximum voltage is half-way up, and it is 
indicated by the dotted sine wave in Fig. 3.10. The voltage of this com- 
ponent is zero at top and bottom. It may be seen that these two compo- 
nents propagating upward together constitute a wave traveling to the right. 
The two components are orthogonal spatially, and the total power is twice 
the power of either component taken separately. 

Figure 3.11 indicates an array of wires obtained by developing an actual 

7 Section 3.3a is referred to as “two turns per wavelength.” This is not quite accurate; 


it is in error by the difference between the lengths of the vertical and the slanting lines in 
Fig. 3.10. 
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helix which has been slit along a helical line normal to the wire of which the 
helix is wound. The dashed slanting lines again connect points which were 
the same point before the helix was slit and developed. Again we assume a 
height of a half wavelength. Thus, if the polarities are maximum +, —, + 

— etc. as shown at the bottom, they will be maximum —, +, —, +, —, + 
etc. as shown at the top, and zero half-way up. In this case the field is a 
standing wave along any horizontal line, and no other component can be 
introduced to make it a traveling wave. Half of the field strength can be re- 
garded as constituting a component traveling to the right and half as a 
component traveling to the left. 
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Fig. 3.11—Voltages on a developed helix for two turns per wavelength. 


Je---nip----o 


The equipotentials used to represent the field about the wires of Fig. 3.9, 
Case I and Fig. 3.10 belong to the field 
V + jW = In tan (3 + jx) (3.31) 
Here V is potential and y is a stream function. There are negative equi- 
potentials about z = x = 0 and positive equipotentials about x = 0, s = 
+7/2. For an equipotential coinciding with the surface of a wire of z-diam- 
eter, 2 Zwire, d/p is thus 
d/p = = 3.32 
p= = (3.32) 
atx = 0,3 < 4/4 
V = Intanz (3.33) 
atz = 0 


V = Intanhx (3.34) 


Hence, for an equipotential on the wire with an z-diameter 2z, the x-diam- 
eter 2x can be obtained from (3.33) and (3.34) as 


2x = 2 tanh” tan z (3.35) 
Of course, the ratio of the x-diameter d; to the pitch is given by 
(3.36) 


where x is obtained from (3.35). 
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In Fig. 3.12, d,/d is plotted vs. d/p by means of (3.35) and (3.36). This 
shows that for wire diameters up to d/p = .5 (open space equal to wire diam- 
eter) the equipotentials representing the wire are very nearly round. 

The total electric flux from each wire is 27¢ and the potential of a wire of 
2-diameter 2z is V = —In tan z. Hence, the stored energy W, per unit length 
per wire, half the product of the charge and the voltage, is 


W, = —7e ln tanz (3.37) 
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Fig. 3.12—Ratio of the two diameters of the wire of a helix for two turns per wave 
length (see Fig. 3.9) vs. the ratio of one of the diameters to the pitch. 





The total distant field and the useful fie!\d component are given by ex- 
panding (3.31) in Fourier series and taking the fundamental component, 
giving 

V = —2 cos 2ze*” (3.38) 
The — sign applies for x > 0 and the + sign for x < 0. Half of this can be 
regarded as belonging to a field moving to the right and half to a field moving 
to the left. 

For a field equal to half that specified by (3.38), which might be part of 
the field of a developed helically conducting sheet, the stored energy W, 
per unit depth can be obtained by integrating (E; + Ez) €/2 from x = 
—x tox = + and fromz = —7/4 to +7/4, and it turns out to be 


W. = } we (3.39) 


If we add another field component similar to half of (3.38), but in quadra- 
ture with respect to z and /, we will have the traveling wave of a helically 
conducting sheet with the same distant traveling field component as given 
by (3.31). Hence, the ratio R of the stored energy for the developed sheet 
to the stored energy for the developed helix is 


| 


R= 2W2/W; = vasage 
In tan z 


(3.40) 
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R is the ratio of the stored energies, and hence of the power flows (since 
the waves both propagate with the speed of light) of a developed helically 
conducting sheet and a developed helix with the same distant traveling fun- 
damental field components. Hence, at a given distance (E?/6?P)'/* for the 
helix is R'/* times as great as for the helically conducting sheet. In Fig. 
3.13, R'8 is plotted vs. d/p. 
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Fig. 3.13—Ratio R'8 pf (42/6?P)'8 for a helix to the value for a helically conducting 
sheet for the distant field. 
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Fig. 3.14—Ratio R'/3 of (£7/s*P)"5 for a helix to the value for a helically conducting 
sheet, field at the inside diameter of the helix or sheet. 
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The maximum available field for the developed helically conducting sheet 
(equation (3.38)) is that for « = 0. The maximum available field for the 
developed helix (equation (3.31)) is that for an electron grazing the helix 
inner or outer diameter, that is, an electron at a value of x given by (3.35). 
The fundamental sinusoidal component of the field varies as exp(— 2x) 
for both the sheet and the helix, and hence there is a loss in E* by a factor 
exp(— 4x) because of this. We wish to. make a comparison on the basis of 
FE’ and power or energy. Hence, on basis of maximum available field squared 
we would obtain from (3.40) 

1 —ts 
ime ~ In tan z 4 nat) 
where x is obtained from (3.35). Figure 3.14 was obtained from (3.32), 
(3.35) and (3.41). 


” 





3.0 

















, 2 TURNS PER WAVELENGTH 


‘ 


— a. 





























——— .- . . r- 


oe — 
































, 
| 
05 06 OF O08 


d 
Fig. 3.15—The transverse impedance of helices with two and four turns per wavelength 
vs. the ratio of wire diameter to pitch. 


In a transmission line the characteristic impedance is given by 


Tee | 
K = "ae (3.42) 


Here L and C are the inductance and capacitance per unit length. This im- 
pedance should be identified with the transverse impedance of the helix. 
We also have for the velocity of propagation, which will be the velocity of 
light, c, 
a eS 
c= V/ LC = V ue (3.43) 
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From (3.42) and (3.43) we obtain 
Ki = Vue/C = Vu/ee/C) 
= 3776/C 
Now C is the charge Q divided by the voltage V. Hence 
K, = 377 €V/Q 
In this case we have 


i 337e In tan z 
2re 


Ke 


(3.46) 
K; = —60 In tan, 


To obtain the impedance of the corresponding helically conducting sheet 
we assume, following (3.30) 


K, = (7/8) (v/Bo) (30/ya) (3.47) 
and assuming a slow wave, let y = £8, so that 
K, = 30/Bya (3.48) 


If we are to have m turns per wavelength, and the speed of light in the 
direction of conduction, then we must have 


Boa = 1/n (3.49) 
whence 
K, = 30n (3.50) 


For n = 2 (two turns per wavelength), K = 60. In Fig. 3.15, the charac- 
teristic impedance K, as obtained from (3.46) divided by 60 (from (3.50)) 
is plotted vs. d/p. 


3.3b Four Turns per Wavelength 


In this case there are enough wires so that we can add a quadrature com- 
ponent as in Fig. 3.10 and thus produce a traveling wave rather than a stand- 
ing wave. Thus, we can make a more direct comparison between the de- 
veloped sheet and the developed helix. 

For the developed helix we have 
A 


, ll a z ‘r) een en 
V + jv = In tan (2 + jx) + mist) 
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If we transform this to new coordinates z;, x, about an origin at z = 0, 
x = 2/4 we obtain 


" (Gt fs + #0) - (5 it =) 
1 — tan (2 + jx) sin 2 (2 + jx) 
We can now adjust A to give a zero equipotential of diameter 22; about x = 
x = 0, 2, = 0 (2 = m/4) by letting 


(3.52) 


A = (sin 2s) In G+ a *) (3.53) 
1 — tan % 
If A is so chosen, there will be roughly circular equipotentials of z-diameter 
2z, about z = + 2/4, etc. There will also be roughly circular equipotentials 
of the same z-diameter about z = 0, +7/2, etc., of potential +V. That 
about z = 0 has a potential 


v «he Gites) ®_D (3.54) 
1 — tan 2/ cos 22; 


where A is taken from (3.53). 
The distance between centers of equipotentials is p = 7/4, so that the 
ratio of z-diameter of the equipotentials to pitch is 


d/p = 2z/(m/4) = 2/(x/8) (3.55) 


The x-diameter of the equipotential about z = 0 (and of those about z = 


es etc.) can be obtained as 2x by letting V have the value given by (3.54) 


and setting > = 0 in (3.51), giving 


‘ A 
V = Intanhx walvapcaney 
wore cosh 2x 


The ratio of this x-diameter to the pitch, d,/, is 
di/p = y/(w/8), 
x is obtained from (3.56). 


To obtain the x-diameter of the 0 potential electrodes we take the deriva- 
tive (3.52) with respect to 2, giving the gradient in the z direction 


aV +j Wy Me sec’ (s; + jx) _ sec (2) + jxy) 
02 dz, +tan(a+jx) 1 — tan (a + jx) 
_ 2A cos 2(z) + jx) 
sin 2(2: + jx) 


(3.58) 
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We then let z; = 0 and find the value of x, for which dV /dz, = 0. When a= 
0, (3.58) becomes 


(1 — tanh’ x) 
(1 + tanh? x) 





A = s'nh 2x, tanh 2x, (3.59) 


As A is given by (3.53), we can obtain x, from (3.57), and the ratio of the 
x-diameter d, to the pitch is 


d2/p = x;/(x/8) (3.60) 


Figure 3.16 shows d,/d and d2/d vs. d/p. 
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Fig. 3.16—Ratios of the wire diameters for the four turns per wavelength analysis. 





























The ratios R and the impedance are obtained merely by comparing the 
power flow for the developed sheet with a single sinusoidally distributed 
component with the power flow for case II for the same distant field. In a 
comparison with the helically conducting sheet, 2 = 2 is used in (3.50). The 
results are shown in Figs, 3.13, 3.14, 3.15. We see that on the basis of the 
largest available field, the best wire size is d/p = .19. 


3.4 TRANSMISSION LINE EQUATIONS AND HELICES 


It is of course possible at any frequency to construct a transmission line 
with a distributed shunt susceptance B per unit length and a distributed 
shunt reactance X per unit length and, by adjusting B and X to make the 
phase velocity and E*/6?P the same for the artificial line as for the helix. 
In simulating the helix with the line, B and X must be changed as frequency 
is changed. Indeed, it may be necessary to change B and X somewhat in 
simulating a helix with a forced wave on it, as, the wave forced by an elec- 
tron stream. Nevertheless, a qualitative insight into some problems can be 
obtained by use of this type of circuit analogue. 
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3.4a Effect of Dielectric on Helix Impedance Parameter 


One possible application of the transmission line equivalent is in estimating 
the lowering of the helix impedance parameter (£°/6°P)!’*. 

In the case of a transmission line of susceptance B and reactance X per 
unit length, we have for the phase constant 8 and the characteristic imped- 
ance K 


B= V/BX (3.61) 
K = VX/B (3.62) 


Now, suppose that B is increased by capacitive loading so that 8 has a 
larger value 84. Then we see that AK will have a value Ka 


Ka = (8/Ba)K (3.63) 


Where should A be measured? It is reasonable to take the field at the 
surface of the helix or the helically conducting sheet as the point at which 
the field should be evaluated. The field at the axis will, then, be changed 
by a different amount, for the field at the surface of the helix is Jo(ya) times 
the field at the axis. 

Suppose, then, we design a helix to have a phase constant 8 (a phase 
velocity w/8) and, in building it, find that the dielectric supports increase 
the phase constant to a value 64 giving a smaller phase velocity w/84. Sup- 
pose 8/8» is large, so that y is nearly equal to 8. How will we estimate the 
actual axial value of (E?/6?P)'/3? We make the following estimate: 


1/3 2/3 
ree Io(Ba) 2 pp2 \1/8 
(E’/p’ P)4” = (2) ( ) (E/p° P) (3.64 

B : Ba Iy(8aa) p 


Here the factor (8/8,4)'* is concerned with the reduction of impedance 
measured at the helix surface, and the other factor is concerned with the 
greater falling-off of the field toward the center of the helix because of the 
larger value of y (taken equal to 8 and 8; in the two cases). 

The writer does not know how good this estimate may be. 


3.4b Coupled Helices 

Another case in which the equivalent transmission line approach is par- 
ticularly useful is in considering the problem of concentric helices. Such 
configurations have been particularly suggested for producing slow trans- 
verse fields. They can be analyzed in terms of helically conducting cylinders 
or in terms of developed cylinders. A certain insight can be gained very 
quickly, however, by the approach indicated above. 

We will simulate the helices by two transmission lines of series impedances 
jX; and 7X2, of shunt admittances 7B; and 7B, coupled by series mutual 
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impedance and shunt mutual admittance jX1. and jBy.. If we consider a 
wave which varies as exp(—jIz) in the z direction we have 
TZ, — jB\Vi — jByV2 = 0 (3.65) 
TV; — jXih — 7Xvl2 = 0 (3.66) 
TI, — 7B2V_ — 7B2Vi = (3.67) 
[V2 — jXele — jXvI, = 0 (3.68) 
If we solve (3.65) and (3.67) for J; and /s and eliminate these, we obtain 


Ve _ —(0 + Xi Bi + Xi By) 
V; X, By + By Xp 


Yn _ —(I" + X2B 2 + X12 Bis) (3.70) 
Vo X2 By + Bi Xx x3 
Multiplying these together we obtain 
IT + (X, Bi + Xe Be + 2X42 By) 


a (X, Xo “me xi) (By By oe By) = 0 


(3.69) 


We can solve this for the two values of I? 
l= —$(X, By + Xe Be + 2X2 By) 
+ 4 [(X1 Bi — X» Bs)? + 4 (X, By + Xo Bo) (Xi2 By) (3.72) 
+ 4(X1 X> By? + By Bo X12%)}'” 


Each value of I represents a normal mode of propagation involving both 
transmission lines. The two square roots of each I? of course indicate waves 
going in the positive and negative directions. 

Suppose we substitute (3.72) into (3.69). We obtain 


—(X, B, — Xe Be) + [(X1 B, — Xe B»)° 
Ve _ + A(X Bi + Xs Bs)(Xi2 Bs) + 4(X1 Xe Ba + Bi Be Xw)l"" og 


73) 


Vi 2(X, By + By X12) 


We will be interested in cases in which X;B; is very nearly equal to X2By. 
Let 


Ar; = X,B, — Xd. (3.74) 


and in the parts of (3.73) where the difference of (3.74) does not occur use 


X, = Xo = X 
(3.75) 
B, = By B 
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Then, approximately 


V, _ —AMo + [(AT 0)’ + 4(X Bi + BXu2))” (3.76) 
V; 2(XBy + BXi) we 





Let us assume that AT? is very small and retains terms up to the first 
power of AT? 
V2 ATo 


7," = ae ee 


(3.77) 
mr, = — XB (3.78) 


V2 
V; 


Ar? /T? 


= &) ~ TRB + XalX) 


(3.79) 

Let us now interpret (3.79). This says that if AT is zero, that is, if X¥,B, = 
X2B, exactly, there will be two modes of transmission, a /ongitudinal mode 
in which V2/V; = +1 and a transverse mode in which V2/V; = —1. If 
we excite the transverse mode it will persist. However, if AT) # 0, there 
will be two modes, one for which V2 > Vj, and the other for which V2 < V;; 
in other words, as AT is increased, we approach a condition in which one 
mode is nearly propagated on one helix only and the other mode nearly 
propagated on the other helix only. Then if we drive the pair with a trans- 
verse field we will excite both modes, and they will travel with different 
speeds down the system. 

We see that to get a good transverse field we must make 

Ato 

nm << 2(Br2/B + NXi2/X) (3.80) 
In other words, the stronger the coupling (By, Xi.) the more the helices 
can afford to differ (perhaps accidentally) in propagation constant and the 
pair still give a distinct transverse wave. 

Thus, it seems desirable to couple the helices together as tightly as _pos- 
sible and especially to see that By. and Xj. have the same signs. 

Let us consider two concentric helices wound in opposite directions, as in 
Fig. 3.17. A positive voltage V, will put a positive charge on helix 1 while a 
positive voltage V2 will put a negative charge on helix 1. Thus, By»/B is 
negative. It is also clear that the positive current 7, will produce flux link- 
ing helix 1 in the opposite direction from the positive current J,, thus mak- 
ing Xy2/X negative. This makes it clear that to get a good transverse field 
between concentric helices, the helices should be wound in opposite direc- 
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tions. If the helices were wound in the same direction, the “transverse” 
and “longitudinal” modes would cease to be clearly transverse and longitu- 
dinal should the phase velocities of the two helices by accident differ a little. 
Further, even if the phase velocities were the same, the transverse and longi- 
tudinal modes would have almost the same phase velocity, which in itself 
may be undesirable. 

Field analyses of coupled helices confirm these general conclusions. 


+1) +V; 


Fig. 3.17—Currents and voltages of concentric helices. 


3.5 ABsout Loss in HELICES 


The loss of helices is not calculated in this book. Some matters concern- 
ing deliberately added loss will be considered, however. 

Loss is added to helices so that the backward loss of the tube (loss for a 
wave traveling from output to input) will be greater than the forward gain. 
If the forward gain is greater than the backward loss, the tube may oscillate 
if it is not terminated at each end in a good broad-band match. 

In some early tubes, loss was added by making the helix out of lossy wire, 
such as nichrome or even iron, which is much lossier at microwave frequen- 
cies because of its ferrornagnetism. Most substances are in many cases not 
lossy enough. Iron is very lossy, but its presence upsets magnetic focusing. 

When the helix is supported by a surrounding glass tube or by parallel 
ceramic or glass rods, loss may be added by spraying aquadag on the in- 
side or outside of the glass tube or on the supporting rods. This is advan- 
tageous in that the distribution of loss with distance can be controlled. 

It is obvious that for lossy material a finite distance from the helix there 
is a resistivity which gives maximum attenuation. A perfect conductor would 
introduce no dissipation and neither would a perfect insulator 

If lossy material is placed a little away from the helix, loss can be made 
greater at lower frequencies (at which the field of the helix extends out 
into the lossy material) than at higher frequencies (at which the fields of 
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the helix are crowded near the helix and do not give rise to much current in 
the lossy material. This construction may be useful in preventing high- 
frequency tubes from oscillating at low frequencies. 

Loss may be added by means of tubes or collars of lossy ceramic which fit 
around the helix. 
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APPENDIX I 
MISCELLANEOUS INFORMATION 


This appendix presents an assortment of material which may be useful 
to the reader. 


CONSTANTS 
Electronic charge-to-mass ratio: 
n = e/m = 1.759 X 10" Coulomb/kilogram 
Electronic charge: e = 1.602 X 10-" Coulomb 
Dielectric constant of vacuum: € = 8.854 & 107” Coulomb/meter 
Permitivity of vacuum: w = 1.257 X 10° Henry/meter 
Boltzman’s constant: k = 1.380 X 10°” Joule/degree 


Cross PRODUCTS 


(A* xX A"). 
(A’ x A” dy = 
(A’ x A"), = 
MAXWELL’S EQuaTions: RECTANGULAR COORDINATES 
OE JF : OH oH ; 
——-— = —jupH, —" — —! = jek, + Je 
Jou ay “a jweEz + 


Oy Oz 
OF, Ok, ‘oul oH, oH, 

— —- W —_ a 
ds On nee ae ax 


: dH 0H 
—jopH, ee en 
Ox oy sii Ox oy 


= Jwek, + J, 


JE. 0k, 


= jock, + J, 
MAXWELL’s EQUATIONS: AXIALLY SYMMETRICAL 
OE : OH 
— = — jwonH, . 
Oz ; Oz 


OH, 


= —(jweE, + J,) 


OEp Ok, 


oH, 
Oz Op O02 Op 


= — jouH, = jweE, + J, 


d : ; d = 
: (pE,) = —jwupH, hd (pH) = p(jweE, + J.) 
Op 0p 
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MISCELLANEOUS FORMULAE INVOLVING /,,(x) AND K,(x) 
; , are ; A : ; Pen #4 
1. Ta(Z) — Tigs(Z) = 7 I,(Z), K,i(Z) — Kyil(Z) = — 7 K,(Z) 


T-1(Z) + Tgs(Z) = 20(Z), Kus(Z) + Kiyi(Z) = — 2K;(Z) 
3. Z(Z) + vI(Z) = ZL1(Z),  ZK.(Z) + vK,(Z) = — ZK,1(Z) 
. ZU (Z) — vIAZ) = Zli4(Z), = ZK(Z) — vK(Z) = — Z2K4:(Z) 
(sy (Z°1,(Z)) = 2°-"I,-n(Z), (45) [Z°K,(Z)} 
= (—)"Z “K,.(Z) 
(x55) end _ Iah2) = ; [KAZ)\ _ ¢_ym Krim(Z) 
‘\Zaz) a fae? ZdZ) \ za | Zim 
.1(Z) = (Z),  —-Ko(Z) = —K,(Z) 
_1AZ) = 1AZ),  K-AZ) = KZ) 


1/2 
, ae Bid Ss 
. Ky2(Z) _ (3) € 


‘ 1,(Ze"™") is e””"™"'T,(Z) 


. St Nn Mvr 


11. K(Ze™") =e" KZ) — TZ) 


sin VT 
12. 1,(Z) Kiyi(Z) + D4i(Z) K(Z) = 1/Z 
For small values of XY: 
13. Io(X) = 1+ .25 X2 + .015625 X4-+ --- 
14. 1,(X) = 5X + .0625 X3 + .002604 X° + -- 


15. K(X) = = -\1 + In (3 )} Io(X) + ; i 


4 


. (Euler’s constant) 


16. A\(X) = — Uh (X) + 5 ni ae 
37 


For large values of X: 
x 


| 5. 0703125 . 073242 | 
(QeX)i2 | 


) 
17. Io(X) tea XY XY? Xa 5 


ee 
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x = o 
18. (xX) ~ {1 ee. 1171875 .102539 _ und 


Pie hy, eee wa 
(2rX)!? 


XY <a } 
1/2 Td - 
aw » wine E 125 , .0703125 .073242 
9, K(X) ~ > a ee > caihiatalgg gion vee 
19. Ko(X) (+) F \! o + a. + 


ee ee 375 1171875 , .102539 
0. Kix) ~() ds ses a. iti ies, ale }. 
Fig. A1.1 shows Jo(X) (solid line) and the first two terms of 13 and the 
first term of 17 (dashed lines). 
Fig. A1.2 shows J,(X) (solid line) and the first term of 14 and the first 
term of 18 (dashed lines). 


Fig. A1.3 shows Ko(X) (solid line) and {9 + In (3)} Io(X) and the 


first term of 19 (dashed lines). 


( 
Fig. A1.4 shows K,(X) (solid line) and \7 + In (*)}ncn + 1/X and 


the first term of 20 (dashed lines). 


200 









































5 6 7 
x 
Fig. Al.l—The correct value of Jo(X) (solid line), the first two terms of the series 
expansion 13 (dashed line from origin), and the first term of the asymptotic series 17 
(dashed line to right) 
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Fig. Al.2—The correct value of /,(X) (solid line), the first term of the series ex 
pansion 14 (lower dashed line), and the first term of the asymptotic series 18 (upper 
dashed line). 
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‘ 2 3 4 5 6 7 
x 


Fig. A1l.3—The correct value of Ko(X) (solid line), — {y + In (3)} Io(X) from the 


series expansion 15 (left dashed line), and the first term of the asymptotic series 19 
(right dashed line). 
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Fig. Al.4—The correct value of K,(X) (solid line), {y + In (3 )) 1,(X) from the 


series expansion 16 (upper dashed line), and the first term of the asymptotic series 20 
(lower dashed line). 
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APPENDIX II 


PROPAGATION ON A 
HELICALLY CONDUCTING CYLINDER 
The circuit parameter important in the operation of traveling-wave tubes 
is: 
(E,/8°P)'” (1) 
B = w/v. (2) 


Here E, is the peak electric field in the direction of propagation, P is the 
power flow along the helix, and » is the phase velocity of the wave. The 
quantity E;/8°P has the dimensions of impedance. 

While the problem of propagation along a helix has not been solved, what 
appears to be a very good approximation has been obtained by replacing 
the helix with a cylinder of the same mean radius a which is conducting 
only in a helical direction making an angle V with the circumference, and 
nonconducting in the helical direction normal to this. 

An appropriate solution of the wave equation in cylindrical co-ordinates 
for a plane wave having circular symmetry and propagating in the z direc- 
tion with velocity 


(3) 


less than the speed of light c, is 
E, = [Alo(yr) + BK (yr) 
where J) and Ko are the modified Bessel functions, and 
2 2 @ ’ 2 2 im 
v=6-(2) =6- 6 (5) 
The form of the z (longitudinal) components of an electromagnetic field 
varying as e’“** and remaining everywhere finite might therefore be 
Ha = Bil o(yr)e“ (6) 
Ex: = Bal o(yr)e (7) 


inside radius a, and 


Hy» = BoK (yr)e** -B2) 
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Eu i BiKo(yne“ (9) 


outside radius a. Omitting the factor e’““**” the radial and circumferential 


components associated with these, obtained by applying the curl equation, 
are, inside radius a, 


Hos = Bs" L(yr) (10) 


Ba = Be hw) (11) 
Y 
Eg. = 1 I,(yr) (12) 


Ey, Be hy) (13) 
and outside radius a 


= i Ki(yr) 
Y 
= —~Bie K,(yr) (15) 
Y 
= B, 224 K,(yr) (16) 
Y 


En Be K,(yr). (17) 


The boundary conditions which must be satisfied at the cylinder of radius 
a are that the tangential electric field must be perpendicular to the helix 
direction 


E.3 sin V + Eg; cos V (18) 

E. sin V + Ey. cos V j (19) 
the tangential electric field must be continuous across the cylinder 

Ex = Ex (and Ey, = Ego), (20) 


and the tangential component of magnetic field parallel to the helix direc- 
tion must be continuous across the cylinder, since there can be no current 
in the surface perpendicular to this direction. 


Ha sin Vv oo A; cos V = Ha sin V 
oa Hos cos Vv. 
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These equations serve to determine the ratios of the B’s and to determine 
y through 
To(ya) Ko(-ya) 2 
(ya)? ST = (Ba cot VW)’. (22) 
I\(ya)Ki(ya) : 

We can easily express the various field components listed in (6) through 
(17) in terms of a common amplitude factor. As such expressions are useful 
in understanding the nature of the field, it seems desirable to list them in 
an orderly fashion. 


INSIDE THE HELIX: 
E, BIo(yr)e? 


E, jg B Ii(yr)e 
¥ 


Es pit) 2 pty 


I(ya) cot ¥ 
_,ByT(ya) 1 i(wt—Be) 
E By Tuva) cot y 6” 
BB Idye) 1 
k Bo Ii(ya) cot p 


jeitot-) 


H, 


j(wt—Bz) 


H, I(yr)e 


.B Bo 7, i 
He a j I Bo I(yr)e" Be) 
k ¥ 


OUTSIDE THE HELIX: 
Io(ya) 


E, = BO Ko(yr)e™ 
Kava) *° yr ye 


; -p»B In(ya) ,, i(wt—B2) 
E, —jB= = Ki(yr)e” 
ty Koya) 


. Io( r j(wt—Bz 
Es oni o(ya) 1 Ky (yrei" Bz) 


K,(ya) cot p 
j ; 2, ae = y K (ane 
B To(ya) oe 
k Ki(ya) cot 
_ _, B Bo In(ya) 
k y Ixi(ya) 


H, 
H, Ki (yer 


Ky (yr) 


k= Vu/e = 120 x ohms 
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The power associated with the propagation is given by 
P=4Re [| EX H*dr 


taken over a plane normal to the axis of propagation. This is 


P = rRe | [ (E,H3 — Ee H*)r dr + / (zee Ey t?)r dr | 
0 a 


or 


poate (4 et) [ tema 
hy 
m (j) (1+ 
fg pone 
E30) 5, 72 [(1+) + i, K z) (Ti 


+(e) ( 


(E:/8°P)** = (B/Bo)\"(y/8)*F (ya) 


where k = 120 a ohms. 
Let us now write 


where 


, a (ya)" ibe: Io Ky 
F(ya) = {( a I Iz) (1 + nt) 
Io K?) Saag 
+(2) (Ko Ke, — Ky (1 +3 *)] : 


We can rewrite the expression for F(ya) by using relations, Appendix I: 


ya) = (1% fo} (4 :) (A - ©) A ie 
Pye) = (70 Z| (7 - L, + K, Ko ae ' (41) 





Communication in the Presence of Noise—Probability 
of Error for Two Encoding Schemes 


By S. O. RICE 


Recent work by C. E. Shannon and others has led to an expression for the 
maximum rate at which information can be transmitted in the presence of ran- 
dom noise. Here two encoding schemes are described in which the ideal rate is 
approached when the signal length is increased. Both schemes are based upon 
drawing random numbers from a normal universe, an idea suggested by 
Shannon’s observation that in an efficient encoding system the typical signal 
will resemble random noise. In choosing these schemes two requirements were 
kept in mind: (1) the ideal rate must be approached, and (2) the problem of 
computing the probability of error must be tractable. Although both schemes 
meet both requirements, considerable work has been required to put the expres 
sion for the probability of error into manageable form. 


1. INTRODUCTION 


In recent work concerning the theory of communication it has been 
shown that the maximum or ideal rate of signaling which may be achieved 
in the presence of noise is (1, 2, 3, 4, 5) 


R:=F loge (1 + Ws/Wy) bits/sec. (1-1) 


In this expression F is the width of the frequency band used for signaling 
(which we suppose to extend from 0 to F cps), Ws is the average signaling 
power and Wy the average power of the noise. The noise is assumed to be 
random and to have a constant power spectrum of Wy/F watts per cps 
over the frequency band (0, F). 

This ideal rate is achieved only by the most efficient encoding schemes 
in which, as Shannon (1, 2) states, the typical signal has many of the prop- 
erties of random noise. Here we shall study two different encoding schemes, 
both of them referring to a bandwidth F and a time interval 7. By making 
the product FT large enough the ideal rate of signaling may be approached 
in either case* and we are interested in the probability of error for rates 
of signaling a little below the rate (1-1). The work given here is closely 
associated with Section 7 of Shannon’s second paper (2). 

In the first encoding scheme the signal corresponding to a given message 
lasts exactly 7 seconds, but (because the signal is zero outside this assigned 
interval of duration) the power spectrum of the signal is not exactly zero 
for frequencies exceeding F. In the second encoding scheme, the signal 

* A recent analysis by M. J. E. Golay (Proc. I. R. E., Sept. 1949, p. 1031) indicates 


that the ideal rate of signaling may also be approached by quantized PPM under 
suitable conditions. 
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power spectrum is limited to the band (0, F) but the signal, regarded as a 
function of time, is not exactly zero outside its allotted interval of length 7. 

It turns out that both schemes lead to the same mathematical problem 
which may be stated as follows: Given two universes of random numbers 
both distributed normally about zero with standard deviations o and », 
respectively. Let the first universe be called the o (signal) universe And 
the second the v (noise) universe. Draw 2N + 1 numbers A“), ASQaa, e+, 
Ay”, -+», A” at random from the o universe. These 2V + 1 numbers 
may be regarded as the rectangular coordinates of a point Py in 2V + 1- 
dimensional space. Draw 2V + 1 numbers B_y,:::, By, --:, By at 
random from the vy universe and imagine a (hyper-) sphere S of radius xo 
= P,Q, where 


N 
m= 2 B= Pi, (1-2) 
n=—N 
centered on the point Q whose coordinates are A,’ + B,, n = —N, 
0,--+, V. Return to the o universe, draw out A sets of 2V + 1 numbers 
each, denote the kth set by AS, --- , Ag”? 
point by Px. 

What is the probability that none of the A points ?;, --- , Px lie within 
the sphere S$? In other words what is the probability, which will be denoted 
by “Prob. (PiQ, --- , PeQ > PoQ),” that the A distances P,Q, --- , PxQ 
will all exceed the radius PoQ? In terms of the A,’s and B,’s we ask for 
the probability that all K of the numbers x, x2, --- , %« exceed x» where 

N 
m= DY (A* — Am — B,) = PO (1-3) 


n=—N 


(k . 
,++:, Ay and the associated 


Expression (1-2) for xo is seen to be a special case of (1-3). The relationship 
between the points Py, Q, Pi, Ps, +++, Pe, +++, Px is indicated in Fig. 1. 

The answer to this problem is given by the rather complicated expression 
(4-12) which, when written out, involves Bessel functions of imaginary 
argument and of order N — 1/2. When N and K become very large the 
work of Section 5 shows that the probability in question is given by 


Prob. (PQ, toe PxQ > PQ) 


= (1+ erf H)/2 + 0(1/K) + O(N” log*” N) 


9 / 2? 
v4 < 


/@, 


1 + r 1/2 
= Ct ) G + 1/2) log. (1 + 1/r) — log, (K + 1) 
N 


where, with r = v 


(1-5) 
2eN(1 + 2r) 


+ ] J, 
ne i + 9p 


2 
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The symbol 0(.V-/? log®/? NV) stands for a term of order N~ log*”? N, i.e., 
a positive constant C and a value Ny can be found such that the absolute 
value of the term in question is less than CN~? log’? NV when N > No. 
In order to obtain actual numerical values for C and No, considerably 
more work than is given here would be required. The term 0(1/K) is of 
the same nature. The “order of” terms have been carried along in the work 
of Section 5 in order to guard against error in the many approximations 


which are made in the derivation of (1-4). 


© 1S THE ORIGIN OF COORDINATES 
ACK) AW)... A) OF POINT Py 
IN SPACE IN 2N +1 DIMENSIONS 








ce) 


Fig. 1—-Diagram indicating relationship between points Po, Q, and P, corresponding 
to signal, signal plus noise, and &* signal not sent (k > 0), respectively. 


The last term within the bracket in (1-5) has been retained even though 
it gives terms of order V~'? log N when (1-5) is put in (1-4) and could 
thus be included in OCV~'” log* V). As shown by the table in the next 
paragraph, inclusion of this term considerably improves the agreement 
between (1-4) and values of Prob. (P:Q,---, PxQ > PoQ) obtained by 
integrating the exact expression (4-12) numerically. This suggests that 
the term O(.V~'? log?/? V) in (1-4) is unnecessarily large. 

Although the “order of” terms in (1-4) give us some idea of the accuracy 
of the approximation expressed by (1-4) and (1-5), a better one is desirable. 
With this in mind the lengthy task of computing the exact expression (4-12) 
for Prob. (PQ, --- , PkQ > PoQ) by numerical integration was undertaken. 
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The values obtained in this way are listed in the second column of the 
following table. The values of Prob. (P:Q,---, P«Q > PoQ) obtained 
from (1-4) (in which the “order of” terms are ignored) and (1-5) are given 
in the third column. Column IV lists values obtained from (1-4) and a 
simplified form of (1-5) obtained by omitting the last term in (1-5). These 
values are less accurate than those in the third column. The values in 
Column V are computed from (1-5) and a modified form of (1-4) obtained 
by adding the correction term shown in equation (5-53) (with B = #H). 
The values in Column V are presumably the best that can be done with the 
approximations made in Section V of this paper, although the first entry 
renders this a little doubtful. 


Prob. (PQ, --- , PrO > PoQ) for N = 99.5 & r = 1 


Numerical 
K+1 Integration (1-4) & (1-5) Col. IV Col. V 


~~ .994 .9995 9987 1.0001 
sa, ta .962 .9650 .9337 .9710 
re .603 621 .5000 605 
ev .1196 1159 .0663 .1176 
— .0065 .00347 .0013 .00586 

It will become apparent later that the value AK + 1 = 2!” corresponds 
to the ideal rate of signaling. The non-integer value of 99.5 for N is ex- 
plained by the fact that the calculations were started before the present 
version of the theory was worked out. It will be noticed that for K + 1 = 
21%¢-8 all of the approximate values exceed the .994 obtained by numerical 
integration. I am in doubt as to whether the major part of the discrepancy 
is due to errors in numerical integration (due to the considerable difficulty 
encountered) or to errors in the approximations. 

In both encoding schemes, the point Py) corresponds to the transmitted 
signal, Q to the transmitted signal plus noise, and Pi, Ps, --- Px to K 
other possible signals. The average signal power turns out to be (V + 1/2)0° 
and the average noise power to be (.V + 1/2)»*. Furthermore, 


Xo = twice the average power in the noise. 


ie OMT ee eae ‘“* plus the &th signal. 


Prob. (P:Q, --: PxQ > PoQ) = Probability that none of the K other 
signals will be mistaken for the signal sent, 
i.e., the probability of no error. 


The random numbers Ao are taken to be distributed normally instead 
of some other way because this choice makes the encoding signals (in our 
two schemes) resemble random noise, a condition which seems to be neces- 
sary for efficient encoding (1, 2). 
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Both of the encoding schemes are concerned with sending, in an interval 
of duration 7, one of K + 1 different messages. According to communica- 
tion theory (1, 2, 3) this corresponds to sending at the rate of 7-! logs 
(AK + 1) bits per second. However, instead of discussing the rate of trans- 
mission, it is more convenient, from the standpoint of (1-4), to deal with 
the total number of bits of information sent in time 7. Thus, selecting and 
sending one of the K + 1 possible messages is equivalent to sending 


M = logo(K + 1) (1-6) 


bits of information. M, or one of the adjacent integers if M is not an integer, 
is the number of “‘yes or no” questions required to select the sent message 
from the A + 1 possible messages (divide the K + 1 messages into two 
equal, or nearly equal, groups; select the group containing the sent message 
by asking the person who knows, “‘Is the sent message in the first group?’’; 
proceed in this way until the last subgroup consists of only the sent mes- 
sage). The amount of information which would be sent in time 7 at the 
ideal rate R, defined by (1-1) is 


M, = TR, = FT loge (1 + 1/r)= (N + 1/2) loge (1 + 1/r) (1-7) 


where use has been made of Wy/Ws = v*/o? = r, and the relation VN < 
FT < N + 1 (which turns out to be common to both encoding schemes) 
has been approximated by NV + 1/2 = FT. 

When (1-6) and (1-7) are used to eliminate NV and K from (1-5) the 
result is an expression for the actual amount M of information sent (in 
time 7) in terms of (1) the amount M, which is sent by transmitting at 
the ideal rate (1-1) for a time 7, (2) the ratio r of the noise power to the 
signa! power, and (3) the probability of no error in sending M bits of in- 
formation in time 7, this probability being given as (1 + erf H)/2: 


M = M, — aM\"H + b (1-8) 


where 


| logs ¢ ic 
“L(+ 1r) log. A + 1/r)j ’ 


2 °°! (1 +r)? logs (1 + 1/7) 


Here the “order of” terms in (1-4) have been neglected together with 
similar terms which arise when V + 1/2 is used for NV in computing a and 
b. The term d is usually small compared to aM’?H. 

The more slowly we send, the less chance there is of error. The relation- 
ship between M, M, and the probability of no error, as computed from 


(1-9) 
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(1-8), is shown in the following table. The probability of no error is de- 
noted by p and the terms are given in the same order as on the right of 
(1-8) in order to show their relative importance. The ratio M/ M,(= R/R;) 
for r = 0.1 is shown as a function of M in Fig. 2. 


For r = Wy/Ws = 0.1 


M, bits M forp=. M for p = .99 M for p = .99999 


5 
10° M,—O+ 3.75 Mr—24.3+ 3.75 M,—44.6+ 3.75 
ae pF, “ — 243+ 7.07 — 46+ 7.07 
108 “+4 10.38 “ — 2430+ 10.38 4460 + 10.38 


Forr = Wy/Ws = 1 


2 M;—-0+ 4.44 M,— 33.4+ 4.44 1] 61.2 + 4.44 
10*  * in, 739 * — Be 9.96 612 + 7.7 
10° “ “4411.08 “ — 3340+ 11.08 6120 + 11.08 


There may be some question as to the accuracy of the values for p = .99999, 
especially for M; = 100, since this corresponds to points on the tail of 
the probability distribution where the “order of” terms in (1-4) become 
relatively important. 

Of course, for a given bandwidth, the ideal rate of signaling R, (given by 
(1-1)) for r = .1 exceeds that for r = 1 in the ratio (logs 11)/(logs 2) = 
3.46. 

The above results agree with the statement that, by efficient encoding, 
the rate of signaling R can be made to approach the ideal rate R; = M, T 
given by (1-1). As applied to our two schemes, the term “efficient encoding” 
means using a very large value of FT or .V. To see this, divide both sides of 
(1-8) by M;, and rearrange the terms: 


1— M/M, = aH MM,” + 0(M7' log M;) (1-10) 


When M, is replaced by R:T in M/M), the fraction M/T occurs. We shall 
set R = M/T and call R the rate of signaling corresponding to some fixed 
probability of error (which determines H). Thus, when (1-7) and the defini- 
tion (1-9) for a are used, (1-10) goes into 


(Ri — R) _ 2H 


—~—— = —_—— —— - + O((log FT)/FT) (1-11) 
R; (G+ OFT} log. + 1/7) * WOE EDED) Und 


Equation (1-11) shows that when r and H are fixed (i.e. when the noise 
power/signal power and the probability of error are fixed) R/R, approaches 
unity as FT — ~. This is shown in Fig. 2 for the case r = 0.1. Since R/ R; = 
M/M,, M/M, must approach unity and consequently M as well as M; in- 
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creases linearly with FT. Thus, for efficient encoding M is large and, from 
(1-6), so is K. 

It should be remembered that equation (1-8) has been established only 
for the two encoding schemes of this article. The question of how much 
faster M/T approaches R; for the more efficient encoding schemes mentioned 
at the end of Section 2 still remains unanswered. 


Wy /Ws = 0.1 
AS M—>o 
1-R/Ry~aHNM 


WITH “a” DEFINED 
BY EQUATION (1-9) 


} 
2 6 4 
105 106 
M=BITS OF INFORMATION IN MESSAGE 
Fig. 2—Curves showing the approach of R/R,; (= M/My,) to unity as the message 
length increases and the probability of no error remains fixed. R is the rate of signaling 
at which the probability of no error is p and Kk; is the ideal rate. 


It gives me pleasure to acknowledge the help I have received in the prepa- 
ration of this memorandum from conversations with Messrs. H. Nyquist, 
John Riordan, C. E. Shannon, and M. K. Zinn. I am also indebted to Miss 
M. Darville for computing the tables shown above and for checking a num- 
ber of the equations numerically. 


2. THE First ENCODING SCHEME 
Suppose that we have A + 1 different messages any one of which is to 
be transmitted over a uniform frequency band extending from zero to the 
nominal cut-off frequency F in a time interval of length 7. The adjective 
“nominal” is used because the sudden starting and stopping of the signals 
given by the first encoding scheme produces frequency components higher 
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than F. A shortcoming of this nature must be accepted since it is impos- 
sible to have a signal possessing both finite duration and finite bandwidth. 
The first step of the encoding process is to compute the integer .V given 

by 
N<FTF<N+1 (2-1) 


We assume that F/T is not an integer in order to avoid borderline cases. 
Let Ws be the average signal power available for transmission and define 
the standard deviation o of the o universe introduced in Section 1 by 
(N + 1/2)o? = Ws. To encode the first message, draw 2V + 1 numbers 
ASX, «++, Ag”’,-+» Ay” at random from theo universe. The signal correspond- 
ing to the first message is then taken to be 
N 
Io(t) = 27AS + DY (A™ cos 2ent/T + A sin 2ent/T) (2-2) 
n=] 
The remaining A messages are encoded in the same way, the signal repre- 
senting the kth message being 


N 
T(t) = 2°7AS? + DO (AW? cos 2eni/T + A“) sin 2ent/T). (2-3) 
n=l 

It is apparent that each signal consists of a d-c term plus terms corre- 
sponding to .V discrete frequencies, the highest being V7 < F, and that 
the average power (assuming /;(/) to flow through a unit resistance) in the 
kth signal is 

e T/2 N 
-* Ft) dt = F(A) + Hr ya)? + (4@)7} (2-4) 
_ 7/2 n=l 
Since the A’s were drawn from a universe of standard deviation o, the ex- 
pected value of the right hand side is (2V + 1)o*/2 which is equal to the 
average signal power Hs, as required. 

We pick one of the A + 1 messages at random and send the correspond- 
ing signal over a transmission system subject to noise. We choose our nota- 
tion so that the sent signal is represented by /o(/) as given by (2-2). Let the 
noise be given by 


N 
I(t) = 2"*By + DY (By cos 2ent/T + B_, sin 2ent/T) (2-5) 


n=] 


where B_y, ++: , By, +++ , By are (2N + 1) numbers drawn at random from 
the normally distributed v universe mentioned in the introduction. The 
standard deviation v of the universe is given by (V + 1/2)h? = Wy, Wy 
being the average noise power. We call J(/) simply “‘noise’”’ rather than 
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“random noise” to emphasize that (2-5) does not represent a random noise 
current unless V and T approach infinity. 

The input to the receiver is Jo(t) + J(¢). Let the process of reception 
consist of computing the K + 1 integrals 


7/2 
m = 27" [ (al) — Bi) — JOP dt, k=O, KK 26) 
L 7/2 

and selecting the smallest one (all of the K + 1 encodings have been carried 
to the receiver beforehand). If the value of k corresponding to the smallest 
integral happens to be 0, as it will be if the noise J(/) is small, no error is 
made. In any other case the receiver picks out the wrong message. 

When the representations (2-2), (2-3), and (2-5) are put in (2-6) and the 
integrations performed, it is found that 


N N 
m= >, (AP —-AM—B,), m= DB, (2-7) 
n=—N n=—N 

which have already appeared in equations (1-2) and (1-3). If, as in Section 
1, P, is interpreted as a point in 2.V + 1 — dimensional Euclidean space with 
coordinates A“Y, «+. , Ao”, +++ , Ay’ and Q is the point ASy + B_y, --:, 
Ay’ +B,,...,A¥’ + By, then x, isthe square of thedistance between points 
P, and Q. Point Py corresponds to the signal actually sent, points P;, «+> , 
Px to the remaining signals, and point Q to the signal plus noise at the 
receiver. The expected distance between the originand P, is ¢(2V + 1)!” 
= (2W’s)'*, that between Po and Q is o(2N + 1)'? = (2Wy)!'?, and that 
between the origin and Q is 


(o? + v*)'2(2N + 1)? = (QWy + 2W ss)!” 


No error is made when 4p is less than every one of x1, %2, ++: , Xx, Le., 
when none of the points: 1, --- , Px lies within the sphere S of radius xo 
centered on Q and passing through Po. Therefore the probability of obtain- 
ing no error when the first encoding scheme is used is equal to the probability 
denoted by Prob. (PQ, +--+, PxQ > PoQ) in the mathematical problem of 
Section 1. 


One might wonder why probability theory has played such a prominent 
part in the encoding scheme just described. It is used because we do not 
know the best method of encoding. In fact, it would not be used if we knew 
how to solve the following problem:* Arrange A + 1 points Po, --+ Px on 
the hyper-surface of the 2 + 1 — dimensional sphere of radius (2W’s)'? 


*C. E. Shannon has commented that although the solution of this problem leads to a 
good code, it may not be the best possible, i.e., it is not obvious that the code obtained 
in this way is the same as the one obtained by choosing a set of points so as to minimize 
the probability of error (calculated from the given set of points and some given Wy) 
averaged over all A + 1 points. 
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in such a way that the smallestof the K(K + 1)/2 distances P, Pz, k, f = 0, 1, 
- + +,K,k #1, has the largest possible value. This would maximize the dif- 
ference (as measured by the distance between their representative points) 
between the two (or more) most similar encoding signals.T 

In this paper we have been forced to rely onthe randomness of probability 
theory to secure a more or less uniform scattering of the points Py,:--, Px. 
In our work they do not lie exactly ona sphere of radius (2W’s)'/? but this 
causes us no trouble. 

3. THE SECOND ENCODING SCHEME 

The second of the two encoding schemes is suggested by one of Shannon’s 
(2) proofs of the fundamental result (1-1). In this scheme the A + 1 mes- 
sages are to be sent over a transmission system having a frequency band ex- 
tending from zero to F cycles per second, and are to be sent during a time 
interval of nominal length 7. 

The first few steps in the encoding process are just the same as in the first 
scheme. V is still given by (2-1) and o by (V + 1/2)o? = Ws. After drawing 
K + 1 sets of A’s, with 2V + 1 in each set, the K + 1 messages are 
encoded so that the signal corresponding to the &th message, k = 0, 1, «+>, 


K, is 


sin r(2Ft — n) 


id - "> 49S (3-1) 


n=—N a(2Ft — n) 


From (3-1), the value of J,(¢) at ¢ = n/(2F) is zero if the integer exceeds 
N in absolute value. If the integer ” is such that || < V, thecorresponding 
value of J,(t) is (FT)?A“. The energy in the &th signal is obtained by 
squaring both sides of (3-1) and integrating with respect to ¢. Thus 


o N 
| fina T*? > sal (3-2) 


n=—N 


when this amount of energy is expended during the nominal signal length 
T is (VN + 1/2)o? which is equal to Wg, as it should be. 
The noise introduced by the transmission system is taken to be 
sin r(2Ft — n) 


N 
Wt) = (FT)? faa =. ~ ati <4 
J(t) = (FT) 2. = (3-3) 


t Possibly if A + 1 discrete unit charges of electricity were allowed to move freely 
on the sphere, their mutual repulsion would separate them in the required manner. In 
2N + 1 dimensions this leads to the problem of minimizing the mutual potential energy 


43(P,P») 


where NV > 1 and the summation extends over k, ¢ = 0, 1, K with k # ¢. However, 
this problem also appears to be difficult. 


—2N+1 
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where the v universe from which the B’s are drawn has, as before, standard 
deviation v given by (V + 1/2)r? = Wy. When the signal /)(¢) is sent, the 
input to the receiver is /»(/) + J(¢) and the process of reception consists of 
selecting the smallest of the K + 1 a,’s 


x, = 27 / (T.(t) —- Io(t) — J(a)]’ dt 


N 


(kh) (o) 2 
= Pa (4 — Aw — B,) 

The second expression for x, is the same as the one given by (2-7) for the 
first encoding scheme, and the discussion in Section 2 following (2-7) may 
also be applied to the second encoding scheme. In particular, the probability 
of obtaining no error in transmitting a signal through noise is the same in 
both systems of encoding, and is given by the Prob. (PiQ, --- , PxQ > PoQ) 
of the mathematical problem of Section 1. 


4. SOLUTION OF THE MATHEMATICAL PROBLEM 


We shall simplify the work of solving the mathematical problem stated 
in Section 1 by taking o = 1 and y*/o* = r. First regard the 4.V + 2 numbers 
A,,”’, B,,n = —N, +++, N as fixed or given beforehand. Geometrically, this 
corresponds to having the points Py) and Q given. Select a typical set of 

. (k . y . ° 
random variables A;,’,n = —N,---,N,k > 0 and consider the associated 
set of variables 

(k) (0) (k) 2 
SS epee A - Fee A yo B,, = A, + Vn- 
yn is a random variable distributed normally about its average value 
i (0) 
Vn i —A n ee B,, 


with standard deviation o = 1. The quantity x;, defined by (1-3) and repre- 
senting the square of the distance between P, and Q, may be written as 


N 
m= > yi, (4-3) 


n=—N 


Thus x; is the sum of the squares of 2V + 1 independent and normally 
distributed variates, having the same standard deviation but different 
average values. The probability density of such a sum is remarkable in 
that it does not depend upon the J,’s individually but only on the smu of their 
squares which we denote by 


n n 


u > T, _ - (A + B,)° 


n=—N n=—N 


- 277 Energy in sent signal + Energy 
in noise 
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This behavior follows from the fact that the probability density of P, has 
spherical symmetry about the origin (because all the 4‘’’s have the same 
a). For the probability that x; is less than some given value x is the prob- 
ability that’, lies within a sphere of radius x'/? centered on Q, and this, 
because of the symmetry, depends only on x and the distance u'? of Q from 
the origin. Accordingly, we write p(x, w)dx for the probability that 
x <a < x + dx when the vy ’s (and hence w) are fixed. 

The probability density p(«, «) may be obtained from its characteristic 
function: 


2) 
p(x, u) = (2r) ‘| e "lave. e'”| dz 
—o 


y 
izr . 2 
ave. é€ = ave. exp i ph vs | 


nm—N 


N 
= || ave. exp [izy,] = (1 — 2is)-*~"” exp [iuz(1 — 2is)~'] 


n=—N 


where we have used (4-3) and, since y, is distributed normally about ,, 


oe 
ave. exp [izy,,| = (2r)7"” a elite 
a 


(1 — 2is)~"” exp [85 is(1 — 2iz)™* 
PL: 


Hence 
oo 
p(x, u) =(20)" (1 — 2iz)*~"” exp [isu(1 — 2is)”' — ica] ds 


a(x OO Ty —172[(ux)' *| er +x)/2 


where it is to be understood that x is never negative. The Bessel function 
of imaginary argument appears when we change the variable of integra- 
tion from z to / by means of 1 — 2iz = 2¢/x, and bend the path of integra- 
tion to the left in the ¢ plane (6). This expression for the probability density 
of the sum of the squares of a number of normal variates having the same 
standard deviation but different averages has been given by R. A. Fisher 
(7). 

We are now in a position to solve the following problem which is somewhat 
simpler than the one stated in Section 1: Given the 2.V + 1 coordinates 
A of the point Py and the 2V + 1 numbers B, so that the coordinates 
A + B,, of the point Q are given. What is the probability that none of the 
K points Pi, Ps, --- , Px, whose coordinates A‘? are drawn at random from 
a universe distributed normally about zero with standard deviation ¢ = 1, 
be inside the sphere centered on the given point Q and passing through the 
other given point Po? In other words, what is the probability that all A of the 
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independent random variables x1, x2, --- , xx will exceed the given value 
xo When u has the value defined by (4-4) together with the given values of 
the A;,’’s and B,’s? The variables x1, x2, ---, %« have the probability 
density p(x, «) shown in (4-6) and 4» is defined by (1-2) and the given values 
of the B,,’s. 

The answer to the above problem follows at once when we note that the 
probability of any one of x, ++: , Xx, say x, for example, being less than 


Xo 1S 


P(%,u) = [ p(x, u) dx. (4-7) 
0 


The probability of x2; exceeding x is then 1 — P(x, «) and the probability 
of all A of x1, +++ , Xn exceeding xp is 


[1 — P(x, u)]* (4-8) 


Instead of being assigned quantities, x» and u are actually random varia- 
bles when we consider the problem of Section 1. Now we take up the problem 
of finding the probability density of « when x» is fixed. Thus, from (4-4), 
we wish to find the probability density of 


N 
u= D0 (An + B,)” (4-9) 
n=—N 
in which the 2.V + 1 numbers A ag are drawn at random from a universe 
distributed normally about zero with standard deviation o = 1 and the 
numbers B_y, ++: , Bo, +--+, By are given. It is seen that w is the sum of 
the squares of 2V + 1 normal variates all having the standard deviation 
o = 1. The nth variate, A)’ + B,, has the average value B,. This is just 
the problem which was encountered at the beginning of this section. Equa- 
tion (4-9) is of the same form as (4-3) and we have the following correspond- 
ence: 
Equation (4-3) Equation (4-9) 
u 
An + By 
B, 
v xX = > B 
The probability that « lies in the interval u, « + du when 4 is given is there- 
fore p(u, xo) du where p(u, xo) is obtained by putting u for x and xo for u 
in the probability density p(x, 1). 
Until now x» has been fixed. At this stage we regard B_y, +--+ , Bo, +++, By 
as random variables drawn from a normal universe of average zero and 
standard deviation vy = or! = r'/*, If the standard deviation were unity, 
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the probability density of x) could be obtained directly from p(x, u) by 
letting « — 0 in (4-6). As it is, the x’s appearing in the resulting expression 
must be divided by r to obtain the correct expression. Thus, the probability 
of finding x) between x and xy) + dx is 


[x»/2r}"~"* 


——+€ 4-10 
2rt(N + 1/2) ‘ (4-10) 


pol x) = 


which is of the x* type frequently encountered in statistical theory. 
It follows that the probability of finding u in (u, u + du) and x» in 
(xo, Xo + dxo) at the same time is po(u, xo) du dxo where 


Po(u, Xo) = plu, Xo) Po(Xo) 


1 Ux) \* 2-4 Che ee (411) 
is 4rC(V + 1/2) 4r’ Ty 1/2 [ (Xo 0) le ] 


The replacement of (x, «) in (4-6) by (au, x») should be noted. 

Now that we have the probability density of « and x» we may combine it 
with the probability (4-8) that all A of x, --- , x« exceed x» when x» and x 
are fixed. The result is the answer to the problem stated in Section 1: 


Prob. (PQ, natn PxQ> PQ) 


a@ 
| 


This result is more complicated than it seems, for Po(u, x) is given by (4-11) 
and P(x», “) is obtained by integrating p(x, u) of (4-6) from x = Oto x = xp 


(4-12) 


2 
du | dx po(u, %o)[1 — P(x, u)|* 
0 


in accordance with (4-7). The remaining portion of the paper is concerned 
with obtaining an approximation to (4-12) which holds when .V and K are 
very large numbers. 


5. BEHAVIOR OF Pros. (27:0, «+: , PxQ > PoQ) As N and K Become LarGE 


In this section we introduce a number of approximations which lead to a 
manageable expression for Prob. (P,Q, --- , PxQ > PoQ) when N and K 
become large. 

Since « and x» are sums of independent random variables, namely 


N 


a= D> (Am + B.) 


n=—N 


N 


Re > 


n=—N 


the central limit theorem tells us that the probability density po(u, x») ap- 
proaches a two-dimensional normal distribution centered on the average 
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N 
a= >> ave.[A?” + Bi] = (2N +1)(1 +7) 
n=—N 
N (5-2) 
fo = >. ave. B, = (2N + 1)r 
n=—N 
Here we keep the convention ¢ = 1, v*/o” = r used in Section 4. The same 
sort of reasoning as used to establish (5-2) shows that the spread about these 
average values is given by 
ave. (u — a)? = (4N + 2)(1 + r)? 
ave. (%) — %)? = (4N + 2)r° (5-3) 
ave. (u — w)(xo — *) = (4N + 2)? 


If the parameters .V, A, and r in the integral (4-12) are such that its value 
is appreciably different from zero, most of the contribution comes from the 
region around # and x, where po(u, x9) is appreciably different from zero. 
However, instead of taking # and # as reference values, we take the nearby 
values 

U2 
(5-4) 
T= j ] 2gr 
as these turn out to be better representatives of the center of the distribu- 
tion. We have introduced the number 
g= N — 1/2 (5-5) 
in order to simplify the writing of later equations. We assume g > 1. 

First, we shall show that 
Prob. (PQ, ee PxQ > PQ) 

(5-6) 


a | ; du | - dxo po(u, xo){1 — P(xo, u)\* + R, 


49g—a 


where a = 2(1 + r)(2q log g)'*, b = 2r(2q log g)' and R;, is of order 1/g 
(denoted by O(1/q)), ie. a constant C and a value gy can be found such that 
| Ri | < C/q when q > qo. From (4-12) it is seen that R; is positive and less 
than 


ug—a bs) @ 
i du + in| [ Axo Pol, Xo) 
0 uet+a 0 
} ve dx ob [ ive] f dupo(u, Xo) 
0 “29 +h 0 


Le 





COMMUNICATION IN PRESENCE OF NOISE 


Since po(#, xo) is the joint probability density of « and x, the integration 
with respect to x» in the first part of (5-7) yields the probability density of 
u, and the integration with respect to u in the second part gives the prob- 
ability density po(x») (stated in (4-10)) of a». Thus (8) 


‘ lu 24 + r)l?e 2(1+r) 
1X9 Po(u, Xo) = 
I AXo Pou, x CET 


—Zo/2r 


[ hitsiiis te) [xo /2r]* e 
aUPoru, Xo) = = - 
0 2rl(q + 1) 
Setting (5-8) in (5-7) and putting « = 2(1 + r)y and x» = 2ry in the two 
parts of (5-7) reduces them to the same form. Thus (5-7) is equal to 


? atl 
2-= vie “dy (5-9) 
P(g + 1) I ' oa , 


with f = (2g log q)'/*. In order to show that (5-9) is O(1/¢) we use the ex- 
pansion 


—yvy+qlog y = —qg+ q log g — (y — g)*/(2g) + (y — g)*/(3¢*) 
—(y- q)‘glq aE ise q)6] 4 


where 0 < 6 < 1. Let v represent the sum of the (y — gq)’ and (y — g)* terms, 
and expand exp 2 as 1 + v plus a remainder term. The integral of exp — 
(y — q)*/(2qg), taken between the limits g + f, can be shown to be of the 
form 1 — O(1/g) by integrating by parts as in obtaining the asymptotic 
expansion for the error function. The term in (y — g)* vanishes upon integra- 
tion and the remainder terms may be shown to be of O(1/q). In all of this 


work a square root of g comes in through the fact that 
1 > (29q)!*g*e~*/T'(qg + 1) > exp [—1/(12g)] (5-10) 


We have just shown that the error introduced by restricting the region of 
integration as indicated by (5-6) introduces an error of order 1g which 
vanishes as g — *. The normal law approximation to po(u, x9) predicted by 
the central limit theorem holds over this restricted region. However, instead 
of appealing to the central limit theorem to determine the accuracy of the 
approximation, we prefer to deal directly with the functions involved. 

Consideration of (5-4) and the behavior of po{u, x») suggests the substitu- 


tion 
xo = 2r(q + a) 


u = 21+ 7r)(7 + 8B) 
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where a and 8 are new variables whose absolute values never exceed 
(2 g log g)'* in the restricted region of integration of (5-6). From (4-11) 


~ \a/2 ui baa 
Pol u,” x») du dx = - A (3,) I,(s' 2) Ot GrtetM da dg (5-12) 
q rr 


in which 
z= ux) = 4r(1 + r)(¢ + a)(q + 8) (5-13) 
In Appendix II it is shown that 


gr” e* 34” exp [(q° +s)? + V] 


yg re (5-14 
Ig + 1)(q? + 2)'4[g + (@ + 2)! 2]¢ 5-14) 


where Vo < 1/(2g + 1) when g > 1. Upon using (5-10) and (5-14) the right 
hand side of (5-12) may be written as 
da dB(2r) "(1 + r)(2r) “(q? + 2) exp [—(1 + r)(2g + a+ 8) (5-15) 
+ f(s) — log (gy + 1) + 0(1/9)] 
with 
f(z) = q log z — q log [g + (q+ 2)'"] + (g? + 2)!” (5-16) 


The value 2 of s corresponding to the central poles (u2, X2) Of po(u, xX) is 
obtained by putting a = 6 = 0 in (5-13): 


4r(1 + r)¢? 
Z— fe 4r(1 + r){ga + B) + af}. 


(5-17) 


Since we are interested in the form of po(#, x9) in the restricted region of 
integration of (5-6) we expand f(z) about s = 2 in a Taylor’s series plus a 
remainder term. 


f(s) = qlog 2rqg + g(1 + 2r) + (3 — ae’ 


‘eo 52)" q)3(3és = (5-18) 
32r'g(1 + Or) . 8 23 24 
6< 1 
> 


In the last term 23 = 22 + (3 — 22)0,0 < 
obtaining this expansion is simplified " (y? + s)'* is replaced by & in (5-16) 
before differentiating. For example, by using 2¢’ = 1, it can be shown that 
f’(s) is simply (q¢ + &)/ (2s). When the extreme values of a and 8 are put in 
(5-17), it is seen that s — 22 does not exceed 0(g°” log’? qg) in the restricted 
region of integration. In the last term of (5-18) 3 is O(g*), &; is O(g) and con- 
sequently the last term itself is O(q~!/? log*/? q). 


= = g°+ 23. The work of 
jl 


9° 
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When the expression (5-17) for (s — 32) is put in (5-18) an expression for 
f(z) is obtained. This expression, together with 
log '(g + 1) = (¢ + 1/2) log g — ¢ + (1/2) log 2x + O(1/q), 
enables us to write the argument of the exponential function in (5-15) as 
qg log 2r — (1/2) log 2rq — Q(a, 8) + O(g-'? log’? g) where O(a, 8) denotes 
the quadratic function 
O(a, B) = [1+ r)*(a? + 8) — 2r(1 + r)agl|D 
D = 1/(2qg(1 + 2r)} 
Similar considerations show that 
(g + 3) = gl + Ar) 1 + O(g-? log!? g)] (5-20) 
When the above results are gathered together it is found that (5-12) 
may be written as 
po(u, Xo) du dxo = D, exp [—Q(a, 8B) + O(g-!” log®* g)] da dp (5-21) 


where 


es es 
_ 2mrq(1 + 2r) 


Expression (5-21) is valid as long as ja and 2, do not exceed 
I iw] 
(2g log g)*”. 


Expression (5-21) differs from the one predicted by the central limit 
theorem (and (5-2) and (5-3)) in that it is not quite centered on the average 
values i, #, which correspond to a = 1, 8 = 1, respectively. Also, g enters 
in place of g + 1. However, these differences amount to O(q'? log! ? g) 
at most, as may be seen by putting a — 1 and 6 — 1 for a and 8 in (5-19). 

By using relations (5-6) and (5-21), it may be shown that 


Prob. (P10, ++: , PeQ > PQ) 


(5-23) 


da dB Die * weeag | — P(x, u)|* a Og” . log’ : q) 
—4 —¢ 


where it is understood that x» and w in P(x, u) depend on @ and 8 through 
(5-11). The term O(g~!? log*”? g) in (5-23) represents the sum of three con- 
tributions. The first is R; in (5-6) which is O(1/qg). The second arises from 
the fact that when the factor exp [O(qg~! log?’ g)] in (5-21) is neglected in 
integrating (5-21) over —f£ < a < f,-€ < 8B < ¢ where f = 
(2g log q)/?, the resulting integral is in error by 0(g~'? log*’? g). The third 
is due to the contributions of the integral from the region' a > ¢,' 8! > ¢. 
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By introducing polar coordinates a = p cos 6, 8 = p sin @ it can be shown 
that the region p > ¢ more than covers the region in question and that 


O(a, 8) 2 (1+ r)p’D (5-24) 


Upon integrating with respect to p and setting in the lower limit , it is 
seen that the third contribution is O(g~”). 
We now assume K to be large. Since 0 < P(x, w) < 1 we have 


0< e*?—(1— P)* < KP *" <1/K (5-25) 


The last inequality follows from x? exp (—x) < 1 for x 2 0. A proof of the 
remaining portions will be found in **Modern Analysis” by Whittaker and 
Watson, Cambridge University Press, Fourth Edition (1927), page 242. 
When we observe that replacing [1 — P(2», u)]* by 1/K in the right hand 
side of (5-23) gives an integral whose value is less than 1/A, we see that 


Prob. (PQ, ee S% PxQ > PQ) (5-26) 


‘a da [ dB Dye MP -KPF0™ +. Q1/K) + 0(g7"” log?” g) 
—=@ 7 


We now take up the problem of expressing the cumulative probability 
density P(x», «) in terms of a and 8. When xo and « lie in the restricted re- 
gion of integration shown in (5-6) they are near their average values 7) = 
(2NV + 1)rand # = (2V + 1)(1 +r). On the other hand the average value 
# of x and the mean square value o; of (x — #)? as computed from (4-6), or 
directly, are 2V + 1+ wand 4V + 2 + 4u, respectively. Thus we see that 
F — xy is of the same magnitude as 4.V and becomes much larger than oz as 
\ — x. The asymptotic development of Appendix I may therefore be used. 
In Appendix J (equations (A1-27) and (A1-29)) it is shown that when 
M(= 2m = 2N + 1) isa large number and 1 << (& — 4X)/o, 


P(xo, u) = (4amb2)? (1 + O(1/m)) exp [mF (21)] (5-27) 


where we have introduced the number m = VN +- 1/2 = g + 1 to save writ- 
ing .V + 1,2 or g + 1 repeatedly and where 
2b. = (1 — 1/2;)°(1 + 4st)!" 
[1 + (1 + 4st)!/*]/2s 
(1 + 4st)'? — s— 14 — log x 
= 2ms = (2N + 1)s, u = Imi = (2N + I) 
Comparison of the last line in (5-28) with (5-11) shows that ms and mt 
are equal to r(y + a) = r(m+ a — 1) and 


(1+r)(¢+ 6) = (1+)r)(m+ 8-1), 





COMMUNICATION IN PRESENCE OF NOISE 


respectively. It is convenient to introduce the notation 
y=a-1l, 6=B-—1 
' (5-29) 
s = r(1 + y/m), t= (1+ r)(1 + 5/m). 


It is seen that for the restricted region in which | a! and | B| are less than 
f = (2q log qg)'”, | y| and |6| are at most 
O(q'/? log!? g) = O(m!? log'/? m). 
Hence s, /, (1 + 4st)'*, 2 differ at most from r, 1 + 7, 1 + 27, 1 + 1/r, 


respectively, by terms of order m™'/? log'/? m. Similar considerations show 
that 


9 


(4amb2)"'? = (2rq)!?D [1 + O(m-!? log! m)] (5-30) 


The argument of the exponential function in (5-27) must be expanded in 
powers of y and 6. It turns out that when ¥ and 6 lie in the restricted region, 
powers above the second may be neglected. For the sake of convenience we 
rewrite (5-13) and introduce 3;: 

3 = xXou = 4m*st = 4r(1 + 1) (m+ y)(m + 8) 
2) = 4r(1 + r)m? 
3 — 3, = 4r(1 + r)[m(y + 5) + yd] 
so that 3 -- 3; is O(m'” log'/? m). Then 


(1 + 4st)'/? = (1 + 3/m’*)'” 


= (1 + 3;/m*)'? + (s — 3,)(1 + 2;/m?)~/2/(2m?) 
- (3g — 3,)°(1 -F 33/m?*)-3/*/ (8m) + Re 


where R, is of the same order as (2 — 3;)°/m’, or m-*” Jog’/? m. It follows 
that 


(1 ‘ 4st)’ , | + Or + 2r(l ot r) ¥Y 
als bo 1+ 2r m 
ar (1 +r) (y + 6)° ——— 

o 4 ons + O(m “~ log” m) 


(1 + 2r)* me 


(1+ 1) r 
= — 41 
r(1+ y/m) | + rte 


“(4 r)\ 5. 5)” 3/9 3/9 ) 
a TS ik 3/2 Iog*!? mm). 
m*(1 + 2r)? ) 
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Combining these and a similar expression for log 7 leads to 
mF (v1) = — m log (1 + 1/r) + 7 — 6 
—[(1 + r)y — rbP/[2m(1 + 2r)| + O(m-? log?/? m) 
= —(q + 1) log (1+ 1/r) +a — 8 — [(1+ ra — rBPD 
+ Oy? log”? 9) 
Substitution of (5-30) and (5-34) in (5-27) gives the result we seek: 


P(xo, u) = (1 + 1/r)-*"(29g)!?D, 


(5-34) 


3 (5-35) 
exp (a2 — B — [(1 + nha — rBPD + O(q-'” log?” q)) 
Since P(x, “) occurs only in the product A P(x, «) in (5-26) we set, in 
view of (5-35), 


K P(x, u) = AX(a, 8) exp S(a, 8) (5-36) 


where A(a, 8) stands for the terms denoted by exp [O(q~"? log*/? g)] in (5-35) 
and 
A = K(1 + 1/r)-*"(22q)'?D, 
(5-37) 
S(a, 8) = a— B— [1+ ra — rgPD 


“ 


As long as|a| < fand 8} < ¢, Xa, 8) is nearly unity and we write 


\y < Na, B) a, de 
(5-38) 
A = 1—6€,2. = 1+ 6€,€ = Cy” log’? gq 
where C is a positive constant large enough to make e dominate the terms 
of order g~!” log*? g in (5-35). g is supposed to be so large that € is very small 
in comparison with unity. 
Setting (5-36) in (5-26) gives 


Prob. (PiQ, --+, PeQ > PoR) = 1+ O1/K) + O(g"? log®? g) (5-39) 


where the contribution of the region outside a < f,|8| < ¢ has been 
returned to the terms denoted by O(g~!” log®? q) (we could have stayed in 
the region a! < f, 8) < ¢ from (5-23) onward, but didn’t do so because 
we wanted to show that the results coming from (5-25) were not restricted 
to this region) and 


4 ( 
] [ ,4< [23 D, exp | — O(a, 8) — AX(a, Beso] (5-40) 


Let L(A) denote the integral obtained by replacing the function A(q@, 8) in 
I by the positive constant A (which we shall take to be either \; or Ae defined 
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by (5-38)). Then, since A exp S(a, 8) is positive, it follows from (5-40) that 
L(y) > I > Lr2) (5-41) 


Also since exp [— AA exp S(a, 8)] lies between 0 and 1 for all real values of 


! 


a and £ it may be shown from (5-24) that (A) is equal to J(A) + O(g-"”) 
where 


J(rA) = | da | dB D,; exp [— la, B) — Ahe* ne (5-4 


Here X is a constant and O(a, 8), A, S(a, 8) are defined by (5-19) and 
From (5-39) and (5-41) we obtain 


Prob. (PQ, ae PxQ m PQ) = J T Al 7 (A}) 
+ (1 — @[J(A2) — J(1)] + 011/K) + O(g-!? log” q) 
where 0 < @ < 1. It will be shown later that /(A,) and J(X2) differ from J(1 
by terms which are certainly not larger than O(g~"”). 

The problem now is to evaluate the integral (5-42) for J(A). It turns out 
that exp [— AA exp S(a, 8)} acts somewhat like a discontinuous factor which 
is unity when S(a, 8) + log AX is negative and zero when it is positive. In 
order to investigate this behavior we make the change of variable 


a—-B=w a=Vv-—rTrw 
AI+ra—-rniB=y 
da dB = dw dy 
From (5-19), (5-37), and (5-42) 
Q(a, 8) = [y? + (1 + 2r)B|D = YD + B?/2¢q 
S(a, 8) = w — yD 


J(A) = | dy | dw D, exp {— yD — 8° /2q — Ade" 


Here and in the following work @ is to be regarded as a function of w and y. 
Split the interval of integration with respect to w into the two subintervals 
(— %, wo) and (wo, «) where 


wy = yD — log Ad (5-46) 


and y is temporarily regarded as constant. In the first interval 


wo 
| exp [— 6°/2q — e” “°| dw 
oe) 


ewe 32/2 [" 0 
= | gh in — | (i- 
= a 


Lx 
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Splitting the interval of integration (— *, we) into (—«, — log AX) and 
(— log AX, wo) in the first integral on the right of (5-47) shows that its con- 
tribution to J(A) is 


D | dy | 


oo 


log AX rand aw 


dwe Va 4 p, | dy | dw i (5.48) 


log AX 
Integrating with respect to y, after inverting the order of integration, shows 
that the value of the first integral is 


ab 
gl? oun 1 + et Bb) (5-49) 


where, from (5-37) and the definition (5-22) of D,, 


B= —i(14 r)! es log AX 


NKr(1 + 1/r)™? (5-50) 


og 
© [2rq(1 + 2r)}'? 


1/2 1/2 
r) q l 


That the value of J(\) differs from (5-49) by O(q-/?) may be seen as 
follows. Since 0 < exp [--8?/2q] < 1, the integral over (we, «) (mentioned 
just above (5-46) and obtained by taking the limits of integration to be zw 
and « in the left side of (5-47)) is positive and less than 


» © 20 


| exp [—e" “| dw = é* dx/x = 219... (5-51) 
wo v1 


Likewise, the second integral on the right side of (5-47) is less than 
al 


| (1 — exp[— e* “°]) dw (1 — e€ *) dx/x = .796... (5-52) 
J x Jo 


Therefore the contribution of the first integral on the right of (5-47) differs 
from J(A) by a quantity less than 


[Die ""?(.219 + .796) dy = O(g""”) 

in absolute value. The contribution of the first integral on the right of (5-47) 
differs from (5-49) by the second integral in (5-48) which is O(q~!/?) because 
it is less than 


a 0 

] Diy Dye"? dy 

The factor (y°D) arises from wy» — (— log AX) when the mean value theorem 

is applied to the integral in w. Hence J(\) differs from (5-49) by O(g-!/*) 
Although (5-49) is a sufficiently accurate expression of J(A) for our pur- 
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poses, it seems worthwhile to set down approximate expressions for the 
terms which have been dismissed as 0(g-!/?). From the above work, 


J(x) = (1 + erf B)/2 + D, | dy ere e 4 exp [— &””*] dw 


wo 
- / e 971 — exp [— e” **]) dw 


+ i ee dy 
'—log Ad 


~~ (1 + erf B)/2 + Ds | dye? {~577.. + yD} es! 


1/2 
= (1+ erf B)/2+ (+) [— 577 --- + 
4xq 


41 + r)7f1 + (2 + 4) Be 


where 6; = y + (1 + 1) log AX and we have made use of the fact that 
8?/2q changes relatively slowly in comparison with w when q is large. 

Since J(A) differs from (1 + erf B)/2 by 0(qg~'”), and since the three B’s 
for \ equal to Ay, 1, and Az differ by not more than O(qg~'? log (A2/A1)) = 
O(q7 log? g), from (5-50) and (5-38), it follows that the terms involving 
J(d) and J(\2) in (5-43) may be included in the term O(g~? log??? g). In 
using our result it is more convenient to deal with V and K + 1 instead of 
g = N — 1/2 and K. Hence instead of B we deal with H defined by 


_1 (+n) | (K +101 + 1/7)" “(1 +7) 


T= 3@ 41/8 ~ x(q + 1/20 + 2r) 


(5-54) 


The difference B — H, with X = 1 and H finite, may be shown to be (with 
considerable margin) 0(1/K) + O(g~'"). From (5-43), as amended by the 
first sentence in this paragraph, it follows that 


Prob. (P,Q, --- , PeQ > PoQ) = (1 + erf H)/2 + 0(1/K) + 0(¢-'” log?” g) 
(1-4) 


where the difference between erf B and erf H has been absorbed by the 
“order of” terms. When g + 1/2 is replaced by N in (5-54) the result is ex- 
pression (1-5) for #. 


retest Dele: saan ecynscnane Seen ee ae 


=v Ee RENEE 
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APPENDIX I 


CUMULATIVE DISTRIBUTION FUNCTION FOR A SUM OF SQUARES OF NORMAL 
VARIATES 


Let x be a random variable defined by 


M 
x= > ¥ (A1-1) 
n=] 


where y, is a random variable distributed normally about its average value 
¥, With unit standard deviation. In writing (Al — 1) we have been guided 
by (4-3), where M = 2. + 1, but here we shall let M be any positive integer. 
In much of the following work M/2 occurs and for convenience we put 


m= M/2 (A1-2) 


From the work of Section 4 it follows that the probability density p(x, u) 
of x is given by Fisher’s expression 


p(x, u) = 2-'(x/u)m!2-V2 Tn s[ (ux) 2]e—t#)/2 (A1-3) 


where u is the constant 
(A1-4) 


Here we are interested in the cumulative distribution function, ie., the 
probability that x is less than some given value x9, 


P(xo, u) = / p(x, u) dx (A1-5) 
Jo 


as M becomes large. In this case the central limit theorem tells us that 
p(x, u) approaches a normal law with average x = M + wand variance = 
ave. (x — x)? = 2M + 4u. The function P(xo, u) has been studied by J. I. 
Marcum in some unpublished work, and by P. K. Bose(9). In particular, 
Marcum has used the Gram-Charlier series to obtain values for P(xo, “) in 
the vicinity of Z for large values of M. However, since I have not been able 
to find any previous work covering the case of interest here, namely values 
of P(x, 4) when x is appreciably less than x, a separate investigation is 
necessary and will be given here. 

Integrating the general expression (4-5) with respect to x between —X 
and xo, letting Y — «, and discarding the portions of the integrand which 
oscillate with infinite rapidity gives 
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1 es) 
2ri —e, above d 
1 2 


2ri —2, below 0 


—1 —tzrg 
3 e 


P(xo, u) = fave. e”’] dz 

(A1-6) 

—1 — izrg 

= 1 se” °[ave. e”*] dz 
where the subscripts “above 0” and “below 0” indicate that the path of 
integration is indented so as to pass above or below, respectively, the pole 
at s = 0. The value of ave. exp (isx) may be obtained by setting V + 1/2 
= m in (4-5). The new notation 


vo = Ms = 2ms, u = 2mt, 2 i (Al1-7) 


enables us to write 


1 -s , 
P(xo,u) = — ~—. ¢-lexp m[—ist — log (1 — ig) 
2ri 2, aboved 


—i+e1 — it)”') df. 


The further change of variable 


carries (A1-8) into 


2mi J 


P\xa,%) = : [ (1 — v) ' exp [mF(v)] dv (A1-10) 
2 K 


where the path of integration A is the straight line in the complex v plane 
running from 1 + ix to 1 — 7% with an indentation to the right of » = 1, 
and 


F(v) = sv — logv + t/v—s — 1. (A1-11) 


The A used here should not be confused with the A denoting the number 
of messages in the body of the paper. We have run out of suitable symbols. 

An asymptotic expression for (A1-10) will now be obtained by the method 
of “steepest descents.”” The saddle points are obtained by setting the 
derivative 


F'(v) = s — 1/0 — t/v? (A1-12) 
to zero and are at 
[1 + (1 + 4st)'/?]/2s 


[1 — (1 + 45¢)'/*]/2s 
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As xo and s increase from 0 to ~, uw and / of course being fixed, we have the 
following behavior: 
— 
s 
VY 
ve = —1 —1/(1+ 2) 


[t is seen that 2, > 0 and v < 0. 

Putting aside for the moment the factor (1 — v)~! in (A1-10), the path of 
steepest descent through the saddle point 2, is one of the two curves specified 
by equating the imaginary part of F(v) to zero. Introducing polar coordi- 
nates gives 


i0 
v = pe 


Real F(v) = (sp + t/p) cos? — logp—s—t 
Imag. F(v) = (sp — t/p) sin@ — 6 
At 1%, 0 = 0, p = 0. Imag. F(x) = 0 and, from (A1-12), 
Real F(v1) = (2s, — 1) — logn. — s — 1 


(A1-16) 
= (1 + 4s/)'? — logy, —s— 


The path of steepest descent through 2; may be obtained in polar form 
by solving 


(sp — t/p) = 6/sin 0 (A1-17) 


for p as a function of @. Setting ¢ = 6 csc 6 and taking the positive value of 
p leads to 


p=[et+ @ + 4st)!?]/2s (A1-18) 


As 6 increases from 0 to 7, g increases from 1 to ~, and p starts from 2 (as 
it should) and ends at ~. Thus, the path of steepest descent through 2 
comes in from v7 = — » + im/s (when@ isnearly 2, p + ¢/s,¢ & a/(a — 8) 
and p(x — 0) m/s), crosses the positive imaginary v axis and bends down 
to cut the real positive v axis (at right angles) at 1, and then goes out to 
vy = —« — im/s along a similar path in the lower part of the plane. It thus 
avoids the branch cut (which we take to run from — « to 0) in the a plane 
necessitated by the term log v in F(v). Since m and s are positive the path of 
integration K in (A1-10) may be made to coincide with the path of steepest 
descent when 2; > 1. This corresponds to the case in which x < x as (A1-14) 
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shows. When 0 < 2; < 1, i.€., © > x» > #, the two paths may still be made 
to coincide but it is necessary to add the contribution of the pole at » = 1 
as K is pulled over it. This is equivalent to passing from the first to the 
second of equations (A1-6). The path 6 = 0 which makes Imag. F(v) of 
(A1-15) zero turns out to be the curve of “steepest ascent’? and hence need 
not be considered. As (A1-13) shows, the saddle point v2 does not enter into 
our considerations because it lies on the negative real v axis and the path 
of integration K in (A1-10) cannot be made to pass through it without 
trouble from the singularity of F(v) at v = 0. 

We now suppose x) < x so that s and / are such as to make x; > 1. In 
order to remove the factor (1 — v) from the denominator of the integrand 
in (A1-10), we change the variable of integration from v to w: 


v—1 =e”, (1 — 2)“dv = —dw 
(A1-19) 


P(x, u) = — x | ex [mF(1 + e”)] dw 


As » comes in along the path of steepest descent, the path of integration L 
for w comes in from w = «© + im and dips down towards the real w axis 
as arg v decreases from 7. L crosses the real w axis perpendicularly at the 
point 

w, = log (% — 1) (A1-20) 


and then runs out to w = «© — im along a curve which tends to become 
parallel to the real w axis. w; may be either positive or negative. When 2 
is almost as large as x, w; is large and negative. 

Since F(v) is real along the path of steepest descent, F(1 + e”) is real 
along L. This real value is — © at the ends of Z and attains its maximum 
value F(v;), given by (A1-16), at w = wy. w; is a saddle point in the complex 
w plane because 


< F(i+e") = F’(it+e")e” = F'(ve” (A1-21) 
w 
vanishes at w = w). 
Instead of F(1 + e”) itself we shall be concerned with 
r= F(1+ e') — Fil +e") (A1-22) 
so that (A1-19) may be written as 
exp {mF(1 + e"')] | oo dw. 
2ri L 


The variable 7 is real on the path of integration L, is zero at w,, and in- 
creases to + « as we follow L out tow = © + im. It is convenient to split 


P(xo,u) = — (A1-23) 
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K into two parts (10). The first part connects « +. im to w; and the second 
part connects w; to «© — im. The values of w on these two parts will be 
denoted by w; and wy;, respectively. Corresponding to each value of 7 there 
is a value w, and a value w7, (in fact it turns out that wy, is the conjugate 
complex of w,;). Changing the variable of integration in (A1-23) from w to 
7, and remembering that K starts at 2 + im, gives 


Psy a) ~ See [ oo” 3 ee... wn] dr (A1-24) 
fy dr dr 


2ri 


Since m is large, most of the contribution to the value of the integral 
comes from around tr = 0 or w = ~). In order to obtain an expression for 
the integrand in this region we note that, because F’(v;) = 0, the Taylor 
series for (A1-22) is of the form 


T= —b(w — wy)? — d3(w — wi)? — by(w — wit — +++ (A1-25) 


The circle of convergence of this series is centered on w; and extends out to 
w = -+tim, these points being the nearest singularities of F(1 + e”) as may 
be seen by setting v = 1 -+ e” in (A1-11) and observing that the singularities 
of log v — ¢/v in the finite portion of the w plane occur at odd multiples of 
+ir. We imagine the branch cuts associated with log v to run out to the 
right from these points along lines parallel to the real w axis. Since (A1-25) 
has a non-zero radius of convergence, the same is true of the two series ob- 
tained from it by inversion, namely 


+y—1/2 1/2 i942 

Wr — WwW, = tbe “rt ~ + b3r/2b5 
ss Sy ges (A1-26) 
+ i[b> “Db, oe 5bo°d3 4]r se 2b. “+p otee 
and the series for w;; — w, obtained from (A1-26) by changing the sign of 
i. Differentiation of these two series gives a series for d(w,; — wy,;)/dr which 
also converges for sufficiently small | 7 | (putting aside the term in 7~!/*), 

and which, when put in (A1-24), leads to 

{mF (vy) 


P(x0, u) ~ 


m\i > Ms [b2°b, — 5b2°b3/4] + } (A1-27) 
| 4m 


€ 
(4armby) 


That this is an asymptotic expansion holding for large values of m follows 
from a lemma given by Watson (11). The conditions of the lemma hold 
since we have already shown that the series for d(w; — wy:)/dr converges 
for |r | small enough. Furthermore, d(w; — wy;)/dr is bounded for a < 7+ 
where 7 is real and 0 < a < the radius of convergence of (A1-26). This 
follows the fact that 


Iw dr |’ ae 
= |S] = [-—F’(1 + e”)e"] , 
dr 


dw 
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is bounded except near w = w (i.e., 7 = 0) and, indeed, decreases to zero 
like —e-”/sasw— © + im (i.e., r— @), 

The values of bz, 63, 6; obtained by expanding (A1-22) and comparing 
the result with (A1-25) are 


bz = F"(n)e™"!/2 
bs = [F’'"(n)e"! + 3F"(r:)e"'1/6 (A1-28) 
by = [F"" (ne! + OF’ (ne! + 7F"(m)e""|/24 

F"’(v) = v?+ 2, F’'"'(v) = —203 — tv, «= FP” (0) = 60-4 + 24 

Our asymptotic expression for P(x, “), when x» < x, is given by (A1-28) 

and (A1-27). Only the leading term of (A1-27) is used in the paper. Some- 

times the following expressions are more convenient than the ones which 

have already been given. 


by = ty °(0y + 2te™*/2 = vy °(r1 + 2W)(r1 — 1)°/2 


(1 — 1/0;)?(1 + 4st)!/2/2 (A1-29) 
F(m,) = (1 + 4st)? — 5s — t — log %. 


In all of these formulas 7; is given in terms of s and ¢ by (A1-13) and s and 
{in terms of x9 and u by (A1-7). 

When x) > x, the saddle point 2 lies between 0 and 1 in the v plane. As 
v follows the path of steepest descent (discussed just below equation (A1-18)) 
arg (v — 1) now stays close to w. From (A1-19) Imag. w stays close to 7 on 
the new path of steepest descent in the w plane, and the saddle point w, 
now lies on the negative real portion of the line Imag. w = x. The new path 
starts at w = «© + im, swings down a little as it comes in, swerves up to 
pass through w,; and then goes out to w = 2% + tm above the branch cut 
joining w = im tow = « + im. The analysis goes along much as for 1; > 1 
except that instead of being 0 the imaginary part of w, is ir. This causes 
the terms in 8; and 0, containing exp (3w;) to change sign. The numerical 
values of 5. and F(v,) are computed by the formulas (A1-29) as before. The 
fact that b, contains the factor exp (i27) shows up only in changing the sign 
of 6)'” to give the minus sign in the leading term: 


P(xo, u) ~ 1 — (4am) be |)~"? exp [mF (0,)] 


which holds for x» > Z. The one arises from the pole at »v = 1 and is the 
same as the one in the second of equations (A1-6). 

In order to see how (A1-27) breaks down near x» = x, we set x» —X% = 
2m(s — 1 — t) = —2me ors = 1+ ¢ — e where € is a small positive number 
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Using o = ave. (x — x)? = 4(m + u) = 4m(1 + 2%) it is found that 
v= 1+ €/(1 + 2) = 1 — 2(xo — X02 
mF (1) = —meé/(2 + 41) = —(x — %)*/20% 
2mb, = m(r, — 1)°(1 + 20) = (xo — %)2/o% 


and that, since w; — — ©, bs; — be and by — 762/12. When these values are 
put in (A1-27) the leading term becomes 


P(x, u) ~ (29) 2(0,/2) exp [—2?/203] 


and the term within the braces in (A1-27) reduces to 1 — o%/z° where z = 
— x9 > 0. Since the asymptotic expansion is useful only in the region where 
the second term within the braces is small in comparison with the first term, 
which is unity,  — xo must be several times as large as oz before we can use 
(A1-27). It will be noticed that the above expression for P(x, «) is closely 
related to the asymptotic expansion of the error function. 


APPENDIX II 
AN APPROXIMATION FOR I y(x) 


When z in the Bessel function J,(qz) is imaginary a formula given by 
Meissel (12) becomes 


ae oF (qy)* exp (qu + V) 
a) = cor(q + 1yw( + we lat 


where w = (1 + y’)' and V is a function of y and g-which, when q is large, 
has the formal expansion 


1f, 2-—3y)\ , y—4y’ 
iA = = danipennbiaiiieanll pe 
" “ey ws f | 16q?w 


~ 2 . 4 -.6 
— 1 fig — 16 + 1512y" — 3654y" + 375y"\ 
57609? | w 


Here we shall show that for y 2 0 and q > 1 


V | < 1/(2¢ — 1) (A2-3) 


Consideration of (A2-2) and also of the method used to establish (A2-3) 
indicates that the inequality is very rough. It doubtlessly can be greatly 
improved (but not beyond the 1/(12g¢) obtained by letting y and g — in 
(A2-2)). Incidentally, it may be shown that the constant terms which re- 
main in (A2-2) when y = © are associated with the asymptotic expansion 
of log T'(g + 1). 
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When (A2-1) is substituted in Bessel’s differential equation, which we 
write as 


- d P 
2 ¢ , : > — 2 1 , mi r”) = 0, 
Y oy I,(qy) + Yay T,(qy) — gq + yay) 


we obtain a differential equation for V: 
vy" on (4 pa y*)w-*/4 = (2qw + w*)y 1p’ 1m y’2 (A2-4) 


Here the primes denote differentiation with respect to y. The constants of 
integration associated with (A2-4) are to be chosen so that 


V — y*/(4g + 4) as y— 0. (A2-5) 


This condition is obtained by comparing the limiting form of (A2-1), in 
which w — 1 + y*/2, with 


(qy/2)* | or | (qy/2)" | gy | 
Sheep) me es hE he SE Sit aoe OE 
&) > rq + 1) q+ilrq+°? Lag +h 
Condition (A2-5) completely determines V since substitution of the 
assumed solution 


V = 4749 + 17? + ay + coy’ of oes 


in (A2-4) leads to relations which determine ¢, ¢2, --- successively. 
Let V’ = v. Then (A2-4) becomes 
v=c— lbv—v (A2-6) 
where c and 6 are known functions of y defined by 
c= (4—y)w'/4, b= (qw+ w?/2)y7 (A2-7) 


From (A2-5), » —> y/(2g + 2) as y — O and ‘therefore 


vy 
V= [ vdy (A2-8) 
Jo 


We first show that | v| < 1/(2g — 1) when g > 1. The (y, v) plane may 
be divided into regions according to the sign of vo’. The equations of the 
dividing lines between these regions are obtained by setting v’ = 0 in (A2-6). 
Thus, for a given value of y, 0’ 
v > v, or v < v2 where 


1 = —b+ (8+ c)'? = c/[b + (0 + c)'?] 
v2 = —b+ (Bh +c)” (A2-9) 


When y > 0 we have b 2 g. A plot of c versus y shows that| ¢| < 1. Hence, 


is positive if 1 < v < v and negative if 
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when g > 1, 
P+c2g—1>(q-— 1) 
|a| < 1/(2g — 1) (A2-10) 
% < —2¢+ 1 


The curve obtained by plotting 7 as a function of y plays an important 
role because, as we shall show, the maxima and minima of the curve for 7 
lie on it. Therefore, the maximum value of | v | cannot exceed the maximum 
value of | 7 |. The maxima and minima must lie on either the 7 or the v2 
curve since v’ vanishes only on these curves. In order to show that it is the 
2, curve we note from (A2-9) that, near y = 0, 0 behaves like y/(2q + 1). 
Consequently both the 7; and v curves start from v = 0 at y = O but fora 
while 7 lies above v which behaves like y/(2g + 2). Here v lies ina v’ > 0 
region and continues to increase until it intersects 7, (as it must do before 
y reaches 2 because 1; = 0 at y = 2) at which point vo = 0, 2; < 0, and 
has a maximum which is less than the maximum of | 7, | so v < 1/(2g — 1) 
when g > 1. Upon passing through 1, v enters a v’ < 0 region and decreases 
steadily until it either again intersects the 2, curve or else approaches some 
limit as y—> *. In either case | v | does not exceed 1/(2g — 1), since, in the 
first case v would have a minimum at the intersection and in the second 
2, ~ 0 as y — «. The same reasoning may be applied to the remaining 
points of intersection, if any, of the v and 2 curves. 

In order to obtain an inequality for V itself we rewrite (A2-6) as 

v” =c — (2b+ v)0 (A2-11) 
The solution of this equation which behaves like y/(2g + 2) as y — 0 also 


satisfies the relation 


y ey 
vy) = | c(x) exp | -| [2b(E) + v(£)] as| dx. 
“0 z 


as may be verified by making use of the relations c(x) — 1 as x — 0 and 


2b(é) — (2g + 1)/é, v(&) — E/(2g + 2) as E > O. For then 


7] 
— [ [2b(E) + v(&)| dE — (2g + 1) log x/y 


7 


vy) > | (x/y)"" dx = y/(2q + 2) 
0 


Hence, from (A2-8) 


ui 


}(y) = | dy | c(x) exp| | [2b(E) + v(€)] as] dx 
/0 “0 er 
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and 


Yi y uv 
[Tin | < ! dy | | ¢(x) | exp | -/ [2b(é) — | o(E) | d 
0 0 z 


From b 2 gand/v{| < 1/(2g — 1) it follows that 26() — | o(&) | > 2g — 1 
when g > 1. This and | c(x) | < (4 + 2°)(1 + 2°)-*/4 gives 


| Vin) | < I dy | (4+ x°)(1 + x) °4" exp [—(2g — 1)(y — x)] dx 


53 1 
16(2q — 1) . 2q— 1 


which is the result we set out to establish. The double integral may be 
reduced to a single integral by inverting the order of integration and inte- 
grating with respect to y. Incidentally, most of the roughness of our result 
is due to the use of the inequality for | c(.) |. 
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Realization of a Constant Phase Difference 
By SIDNEY DARLINGTON 


This paper bears on the problem of splitting a signal into two parts of like am- 
plitudes but different phases. Constant phase differences are utilized in such cir- 
cuits as Hartley single sideband modulators. The networks considered here are 
pairs of constant-resistance phase-shifting networks connected in parallel at one 
end. The first part of the paper shows how to compute the best approximation 
to a constant phase difference obtainable over a prescribed frequency range 
with a network of prescribed complexity. The latter part shows how to design 
networks producing the best approximation. 

PERENNIAL problem is that of designing a circuit to split a signal 

into two parts which are the same in amplitude but which differ in 

phase by a constant amount. A 90-degree phase difference is needed, for 
example, in the single sideband modulation system due to R. V. L. Hartley. 
It is well known that it is not possible to obtain exactly equal amplitudes 
and exactly constant phase differences at all frequencies except in the 
trivial special case of a 180-degree phase difference. Various methods have 
been devised, however, for approximating these characteristics over finite 
frequency ranges. The most obvious method is to use a pair of constant 
resistance phase shifting sections in parallel at one end and with separate 
terminations at the other end? as indicated in Fig. 1. 

This paper is devoted to the problem of obtaining approximately constant 
phase differences under the specific assumption that pairs of constant re- 
sistance phase shifting networks are to be used. The paper has been written 
with two objects in mind. The first is the development of a method for 
determining the best approximation to a constant phase difference which 
can be obtained over a prescribed frequency range with a pair of phase 
shifting networks of a prescribed total complexity. The second object is 
the description of a straightforward design procedure by means of which 
the networks can be designed to give this best possible approximation. 

The problem under consideration is typical of those usually described 
as problems in network synthesis. In other words, a network of a prescribed 
general type is to be designed to approximate as closely as possible an ideal 
operating characteristic of a prescribed form. The same procedure will be 
followed as that appropriate for most such problems. The procedure begins 
with the development of a mathematical expression representing the most 

1U'. S, Patent 1,666,206, 4/17/28, Modulation System. 


2 Another common method uses reactance shunt branches between effectively infi- 
nite impedances, such as the plate and grid impedances of screen grid tubes. 
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general characteristics which can be obtained with the prescribed type of 
network. This is followed by the determination of particular choices of the 
arbitrary constants in the expression, which will lead to the best approx- 
imation to the prescribed ideal characteristic. The next step is to deter- 
mine formulae for the degree of approximation to the ideal, which will be 


PHASE -SHIFTING 
NETWORKS 


e 
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Fig. 1—Phase-shifting networks for approximation to a constant phase difference. 
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Fig. 2— Variation in phase difference, when average is 90°, with a network of n sections 


obtained with those particular values of the constants. The final step is 
the development of a method for determining corresponding actual net 
works. 

From the optimum choice of constants, curves can be calculated which 
show what can be done with a network of any given complexity (Fig. 2). 
Then the complexity needed for any particular application can be read 
directly from the curves. The special choice of constants also leads to special 





96 BELL SYSTEM TECHNICAL JOURNAL 


formulae for element values of corresponding networks, using tandem sec- 
tions of the simplest all-pass type (Fig. 3). 


FORM OF THE tan (3) FUNCTION 
If 8; and B» represent the phase shifts through the two constant resistance 
- B» , 
networks of Fig. 1, then tan (*) and tan ¢ must both be realizable 


as the reactances of physical reactance networks. In other words, these 
quantities must be odd rational functions of w with real coefficients and 
must also meet various other special restrictions. If 8 is used to represent 


the phase difference 8B. — 8; , the function tan (5 


~ 


) mus also be an odd 


rational function of w with real coefficients. Because of the minus sign 


“OO0™ 


Fig. 3—Simplest all-pass section. 








’ ; ’ ees Pr. B 
associated with 8; in the definition of 8, however, tan (3 does not have to 


- faa By 
meet the additional restrictions which must be imposed upon tan ef and 


~ 


tan (*). In a later part of the paper a method will be described by which 


a pair of physical phase shifting networks can be designed to produce any 


) 


tan ( function which is an odd rational function of w with real coefficients. 
In any range where the phase difference 8 approximates a constant, the 


B , 
function tan (5 will also approximate a constant.: Hence, the present 


problem is really that of approximating a constant over a given frequency 
range with an odd rational function of w with real coefficients. In this prob- 
lem, the degree of the function must be assumed to be prescribed as well 
as the frequency range in which a good approximation is to be obtained, 
for the degree of the function determines the complexity of the correspond- 
ing network. 

W. Cauer shows how functions of certain types can be designed to approx- 
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imate unity in pres-ribed frequency ranges.* These functions, however, are 
not odd rational functions of frequency but are irrational functions appro- 
priate to represent filter image impedances or the hyperbolic tangents or 
cotangents of filter transfer constants. It turns out, however, that they 
can be transformed into odd rational functions of the desired type by a 
simple transformation of the variable. 

Each of Cauer’s functions is said to approximate a constant in the Tcheby- 
cheff sense, which means that in the prescribed range of good approximation 
the maximum departure from the approximated constant is as small as is 
permitted by the specifications on the frequency range and the degree of 
the function. Each function also has the property of exhibiting series of 
equal maxima and equal minima in the range of good approximation, such 
as those indicated in the illustrative 8 curve of Fig. 4. 


IN DEGREES 
oO o oO 
o = iv) 


PHASE DIFFERENCE, @, 
@ 
o 


8 
30 40 60 80100 200 400 600 1000 2000 4000 
FREQUENCY IN CYCLES PER SECOND 
Fig. 4—-Example of a phase difference characteristic. 


Of the various forms in which Cauer’s Tchebycheff functions F can be 
expressed, the following form is the one appropriate for showing how odd 
rational functions of frequency can be obtained: 


When 2 is odd 


When n is even 


k 
8 


Hs; 
Ve fw Ena) 


s=l 


U 





3 “Ein Interpolationsproblem mit Funktionen mit Positivem Realteil,”” Mathematische 
Zeitschrift, 38, 1-44 (1933). 
4 The data for the illustrative curve were obtained from a trial design carried out by 


P. W. Rounds. 
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In these equations, the symbol sm indicates an. elliptic sine, of modulus 
k, while K represents the corresponding complete elliptic integral. U is 
merely a constant scale factor, while » is an integer measuring the complex- 
ity of corresponding networks. In the case of phase-difference networks, 
n represents the total number of sections of the type indicated in Fig. 3, 
which are included in the two phase-shifting networks or their tandem sec- 
tion equivalents. 

In Cauer’s filter theory, the variable X represents a rational function of 
w which permits F to be an image impedance or a coth (5) function. In 
order that F may be an odd rational function of w, however, as is required 


when it is to represent tan (5) , X must be defined by the relation 


(2) w= wo/l — X2 


Cauer shows that F approximates a constant in the Tchebycheff sense in 
the range 0 < X < k. Hence, in terms of w, the range of approximation 
is w, < w < we, where w; and w, are arbitrary provided the modulus k is 
assumed to be determined by the relation 

(3) p= Veta oi 


Ww, 


, 8 
ALTERNATIVE EXPRESSION FOR THE tan 5 FUNCTION 


While equations (1) are the most convenient form of F to use in deriv- 
ing the transformation of the variable, an alternative more compact form 
is more suitable for determining the degree of approximation to a constant 
phase difference and the element values of corresponding networks. When 


3 ; R 
F represents tan (5) and hence w and X are related as in (2), the equivalent 


expression is as follows:° 


2) 7 U dame FA) 
tan ( U dn\{ nu —, kh 


w = w dn(u, k). 


In this expression, dn represents a so-called “dn” function, the third type of 
Jacobian elliptic function usually associated with the elliptic sine, or sn 
function, and the elliptic cosine, or cn function. The symbol wu represents 

5 This expression depends on a so called modular transformation of elliptic functions 
not found in the usual elliptic function text. The transformation theory may be found in 
“An Elementary Treatise on Elliptic Functions,” Arthur Cayley, G. Bell & Sons, Lon- 
don, 1895. 
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a “parametric variable” which would be eliminated on forming a single 
equation from the two simultaneous equations indicated. The modulus fy, 


4 is related to the modulus &, 


~ 


of the dn function corresponding to tan ( 


of the dn function corresponding to w, in the manner indicated below. The 
constant K,, of course, represents the complete integral of modulus &;, 
just as K represents the complete integral of modulus &. 

Corresponding to any modulus & there is a so-called modular constant g. 
Using qi to represent the corresponding modular constant of modulus fy, 
it is here required that 


(5) gq: = Q". 


One modulus can be computed from the other by means of this relation- 
ship and tabulations of logio g vs sin“ & which are included in most elliptic 
function tables.® 


DEGREE OF APPROXIMATION TO A CONSTANT PHASE DIFFERENCE 
When uw is real and varies from zero to infinity, the corresponding value 


of w as determined by (4) merely oscillates back and forth between the values 
w; and we. In other words, it merely crosses back and forth across the range 


L 


in which tan (5 approximates a constant. Similarly, when w is real and 


increases from zero to infinity, tan (3) oscillates between U'+/1 — k? and 
U. The equal ripple property of the curve illustrated in Fig. 4 is explained 


by the fact that the period of oscillation of tan (5) with respect to u is 


? 


) passes through several ripples 


‘ ‘ 8 
merely a fraction of that of w, so that tan( 


~ 


while the value of w moves from w; to we. 
Combining the formulae for the maximum and minimum values of 


tan (5) gives the relation 


(6) tan (5) «T= V1 — i) 
2 1+ U'V/1 — k 


6 When & is extremely close to unity, it may be easier to obtain accurate computations 
by using the additional relation 


logio (g) logic (g’) = (. " ) 
Be } 


. sey be 
where g’ is the modular constant of modulus Vil-—- B= ~ 


we earnest oe Re SERED CT NS 


“ 
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in which 6 represents the tota! variation of the phase difference 6 in the 
approximation range. Similarly, the average value 8, of 6 in the approxi- 
mation range is given by’ 
U(1+ V1 — #) 
1-UV/1 — k 

If the phase variation 6 is reasonably small, (6) and (7) can be replaced 
by the approximate relationships 


tan (8,) = 


sin (Ba) ,2 
7 A 


“~ 


Ba Pe = 9 
tan (*) =Uv1— ki} 


A still further modification is obtained by replacing k; by the quantity 16q, 
which is an approximate equivalent when &; is small, and by then replacing 
gi by the equivalent g” of (5). This gives 


ra lians 


(8) 


(9) = § sin (Ba)q” 
tan (4 =UV/i — 164". 


When combined with (3) and tabulations of sin~'(k) vs logio(qg) , these 
formulae can be used to compute 6 when the parameters w, w2, 8, and m are 
prescribed. Curves of 6 are plotted against w2/w; in Fig. 2, assuming 6, to 
be 90 degrees. 


DETERMINATION OF A NETWORK CORRESPONDING TO A GENERAL 
PHASE DIFFERENCE FUNCTION 


“ 6 , : : 
Since tan (’) must be an odd rational function of w, it can be expressed 


in the form 


é Py wB 
(10) tan (3) “s 


in which A and B are even polynomials in w. This requires 


B 
D) 


“ 


(11) = arg (A + iwB). 


7 More exactly, 8. is the average of the maximum and minimum values of 8 occurring 
in the range of approximation. 
*In the important special case in which the average phase difference Ba is 90°, this 


ee oe . 
expression for tan (% is exact rather than approximate. 
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Similarly, if attention is focused on the phase shifts of the individual 
phase-shifting networks rather than on the phase difference, the following 
odd rational functions can be introduced: 


tan (3 ~< 

2 A, 
‘ Be wB, 
tan i A 


in which Ai, Bi, Az, and Bz are additional even polynomials in w. This 
requires 


(12) 


arg (A; + iwB;) 
(13) 


arg (A> + twB,). 
It also requires 
(14) arg (A, — iwB,). 


Since the argument of a product is the sum of the arguments of the sep- 
arate factors, (13) and (14) require 


(15) B _ Be — Bi 


5) 5} == S5¢ (A> a tw B»)(Ay we iwB,). 
This permits us to write 
(16) (Az + iwB2)(A1, — iwB,) = H(A -+ iwB) 


in which #7 is a real constant. 


When tan (5) is prescribed, a corresponding polynomial of the form 


(A + iwB) can readily be derived. The problem is then to factor it into 
the product of two polynomials (Az + iwB2) and (A; — iwB,) such that 
A, By, Az, and By determine physically realizable phase shifts through 
(12). Two factors of the general form (A» + twB:) and (A; — iwB,) can 
readily be obtained in a number of ways. The only question is how to obtain 
them in such a way that the corresponding phase characteristics will be 
physical. A procedure meeting this requirement is described below. 

The variable w is first replaced in (A + iwB) by p representing iw. This 
leaves a polynomial in p with real coefficients, since A and B represent 
polynomials in w*, while p? represents —w*. Suppose all the roots of the poly- 
nomial A + pB are determined. Then this polynomial can be split into 
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two factors by assigning various of the roots to each of the two factors. 
It turns out that physically realizable phase characteristics will be obtained 
if all those roots with positive real parts are assigned to the factor (A; — pB,) 
which appears in (16) when iw is replaced by 4, all other roots being assigned 
to the factor (Az + pB). 

The physical realizability of the above division of the roots follows from 


PB, 


a theorem which states that = is realizable as the impedance of a two- 
z 


terminal reactance network whenever A, and B, are even polynomials in 
p with real coefficients such that A, + pB, has no roots with positive real 
parts.® From this theorem and the fact that the evenness of A, and B, 
causes them to remain unchanged when is reversed in sign, it follows that 


pB, 


= will also be the impedance of a physical two-terminal reactance net- 
4ig 
work whenever A, — pB, has no roots with negative real parts. Thus, by 


By 


(12) the above division of the roots of A + pB makes tan (* and tan 


Be ; . ; 
(4 realizable as the impedances of two-terminal reactance networks. 


These reactance networks and their inverses are merely the arms of unit 
impedance lattices producing the phase characteristics defined by (12). 
The above argument merely shows that each of the two phase-shifting 


By 
networks can at least be realized as a single lattice when tan (*') and 


39 . ‘ 
tan (3) are determined by the method described. Actually, they can be 


broken into tandem sections directly as soon as the roots of (Ai; — pBy) 
and (Ay + pBz) have been determined. From (A; — pB,), the quantity 
(4, + pB,) can be found by merely reversing the signs of the roots. Then 
by using the principle that the argument of a product is the sum of the 
arguments of the separate factors, phase-shifting networks can be designed 
corresponding to various factors or groups of factors as determined from 
the known roots of (A; + pB,) and (Az + pB2) . There can be a separate 
section for each real root and each conjugate pair of complex roots.!° 


DETERMINATION OF A NETWORK CORRESPONDING TO A TCHEBY- 
CHEFF TYPE OF PHASE DIFFERENCE CHARACTERISTIC 


The procedure described above for determining a network corresponding 
to a general phase difference characteristic is complicated by the necessity 


® See “Synthesis of Reactance 4-Poles which Produce Prescribed Insertion Loss Char- 
acteristics,” Journal of Mathematics and Physics, Vol. XVIII, No. 4, September, 1939— 
page 276. 

0 See H. W. Bode, “Network Analysis and Feedback Amplifier Design,” D. Van 
Nostrand Company, New York, 1945, Page 239, §11.6. 
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of determining the roots of the polynomial A + pB. In the case of the 
Tchebycheff type of characteristic described in the first part of the paper, 
the required roots can be determined by means of special relationships. 


In the first place, the roots of A + pB are the roots of (1 + 2 tan *) . In 


other words, by equation (4) they are the rots of 1 +i in nu af ; is) | 


The values of u at the roots turn out to have an imaginary part iK’, where 
K’ is the complete elliptic integral of modulus 1/1 — &. If a new variable 
u’ is defined by 

(17) u= u' + ik’ 


the roots can be shown to correspond to the values of u’ determined by 


(wf) 
sn \ nu K’ ky 
(18) = —U, 


K, ) 
=e 
on (mu K? 1 


If it is assumed that the phase variation is small in the range of approx- 
imation to a constant, it can be shown that one value of u’ determined 
by the above relation is given approximately by 

, 
nu’ 3 
(19) K ‘ies: — Ba 
where 8, is the average phase difference for the range of approximation as 
before (in radians). After this value of «’ has been computed, all the roots 


a K 
of E +1U dn (mu “Z ms) | can be found by computing the values of w 
’ 
corresponding to this value of «’ and to those values obtained by adding 
‘ ; a K ii ad 
integral multiples of the real period of an mu = , hs) This gives the 
n 


following formula for the roots in terms of p = ww. 


cn (25 +- Ws) 
(20) pum, 2 $ . ee 
(2x ) ‘ 
sn + Uy 


nv 


in which wo is the value of «’ determined by (19). 

Finally, instead of using the above elliptic function formula directly, one 
may replace the elliptic functions by equivalent ratios of Fourier series 
expansions of @ functions. This gives 


mm n= Vane q cos (3d) + 4 cos (5rs) + -- 
y = 1W2 


sin (Av) — g sin (3Ae) + ¢ sin (SAv) *** 
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in which the angle A, is defined by 


: a -180° — 38, 
(22) Ne = - “—" degrees, ++) (n — 1). 
Because all the p,’s are real in this Tchebycheff case, corresponding net- 
works can be made up of sections of the simple type indicated in Fig. 3. 
In one of the two phase-shifting networks there will be one section for each 
positive p,, and it will be given by 


eae 
Rops 


where Ro is the image impedance. Similarly, in the second phase-shifting 
network there will be one section for each negative p,, and it will be given by 


—1 
C= ——. 
Robs 





Conversion of Concentrated Loads on Wood Crossarms to 
Loads Distributed at Each Pin Position 


By RICHARD C. EGGLESTON 


NE of the most important requisites in all fields of engineering endeavor 
is knowledge of the strength of materials. The development of testing 
machines and techniques to study the basic properties of metals, plastics 


and wood products to withstand breaking forces has been a distinctive 
achievement during the last half century. All materials, whether they be 


part of a bridge, a building, a shipping crate, a telephone pole or a crossarm 
on a telephone pole, break under an excessive stress. To have accurate 
knowledge of the strength of the millions of crossarms used to carry the 
regular load of wires, which are frequently subjected to the extra loads of 
wind and ice, is most important in electrical communication. 

When strength tests of crossarms are made, the information most gen- 
erally sought is how great a vertical load equally distributed at each insulator 
pin hole will the arms stand. In the past many crossarm tests have been 
made by the concentrated load method, where the arm is either supported 
at each end and loaded at the center, or supported at the center and loaded 
at the ends until failure occurs (Fig. 1,a and b). Some have been made by 
the distributed load method by placing, manually and simultaneously, 50- 
pound weights in wire baskets suspended from each pin hole, and continuing 
such load applications until the arm fails. The method is objectionable 
chiefly because, in many of the tests, the loading is inadvertently carried 
past the maximum loads the arms will support. This objection was over- 
come in recent tests made by the Bell Telephone Laboratories', where the 
loads were also distributed at each pin position. However, instead of sub- 
jecting the 10-pin test arms to sudden 500-pound load increments (viz. 50 
pounds at each of the 10 pin holes), the loads were applied gradually by a 
hydraulic testing machine (Fig. 1, c). But, in spite of the advantages of 
this machine method of distributed load application, it is probable that, be- 
cause of the less elaborate apparatus involved in simple beam tests, there 
will continue to be tests made by the concentrated load method. 

Where tests have been made by the concentrated load method, the ques- 
tion arises how can the results be converted to a load-per-pin basis? A 
conversion is needed before a fair comparison can be made of all test results, 
and also to furnish the information generally most wanted, which is, as 

'Bell System Monograph No. B-1563, Strength Tests of Wood Crossarms. 
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previously stated, what load per pin will the arm support? There are more 
than twenty million crossarms in the pole lines of the Bell System and each 
year about a million arms are added. A complete understanding of every 
problem associated with this important item of outside plant material is 
manifestly worth while. This paper is intended to contribute to that end. 
It presents a solution of the problem of converting concentrated vertical 
loads to comparable loads distributed at each insulator pin position. 

The location of the critical section in crossarms is a basic factor in a study 
of the problem. The critical section of a crossarm is the section at which the 
fiber stress is greatest when the arm is loaded. It is the section where the 
arm may be expected to break if overloaded. To determine its location, the 
bending moment at various sections along the arm is divided by the section 
modulus of the respective sections. The quotient in each instance is the 
fiber stress for each section investigated. The location showing the greatest 
fiber stress is the critical section. Since horizontal shear is not the control- 
ling stress in crossarm failures under loads distributed at each pin hole, 
bending stresses only were considered in this analysis. 

Because of the differences in arm shape and in the spacing of pin holes, 
the location of the critical section is not the same in all arms. It is es- 
timated that at least three fourths of the arms in the Bell System are 10A 
and 10B crossarms.? Both are 10 feet long and 3.25” x 4.25” in cross section. 
In the 10A arm (Fig. 3), the space between the pin holes is 12 inches, except 
between the pole pin holes, where the space is 16 inches. In the 10B (Fig. 
4) the pin hole spacing is 10 inches with a 32-inch space between the pole 
pins. Both types zre bored for wood pins. Most of the arms now in the 
plant are ‘“‘roofed’’, that is, the top surface of the arm, except the center foot 
of length, is rounded on a radius of about 4.25 inches. Under the current 
design, however, the top surface of Bell System arms is flat, except for the 
edges, which are beveled. 

Previous studies of both roofed and beveled arms of various types have 
shown that the critical section of clear arms under vertical loads is either at 
the center or at the pole pin hole sections. This study is confined to those 
sections of clear 10A and 10B crossarms of nominal dimensions, both roofed 
and beveled. Moreover, it was assumed for the purpose of load analysis, 
that the crossarms are supported at the center only; since, under loads on 
each side of the pole, the standard crossarm braces provide no significant 
support when the loads are sufficient to break the arm. 


Roorep 10A ARM 


Let it be assumed that the breaking load concentrated in each end pin 
hole of a roofed 10A arm is 800 pounds. As shown in Calculation 1 in the 


210A and 10B crossarms were formerly known as Type A and Type B crossarms, 
respectively. 
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appendix, the bending moment at the center of the arm from the assumed 
loads would be 44,800 pound-inches, and the fiber stress at the center would 
be 4600 psi. That calculation also shows that the bending moment at the 
pole pin holes would be 38,400 pound-inches, and the fiber stress at the pole 
pin holes 7515 psi. Since the stress at the pole pin holes is greater than that 
at the center, the critical section of a roofed 10A arm is at the pole pin holes 
when the arm is subjected to a breaking load at each end pin hole. 

The information wanted, however, is what load at each of the ten pin 
positions would have produced the same moment and same fiber stress at 
the critical section? A /entative answer is found by dividing the 38,400 
bending moment by the “total-per-pin” lever arm*, 120” (see Calculation 
1) or 320 pounds. Checking to determine whether the location of the criti- 
cal section changes under loads of 320 pounds at each pin position, Calcula- 
tion 1 shows that the fiber stress at the pole pin holes and at the center 
would be 7515 psi and 5257 psi, respectively. Since the stress at the pole 
pin holes is greater, it is clear that the critical section is there also under 
equal loads at each pin position; and the 320-pound load per pin is com- 
parable to the concentrated load of 800 pounds at each end of the arm. 

If a similar investigation were made of a roofed 10B arm and of a beveled 
10A arm, it would be found that the pole pin hole section is the critical 
section of these arms; and that the load per pin comparable to concentrated 
loads at the arm ends would, like the roofed 10A arm, be equal to the bending 
moment at the pole pin hole section due to the concentrated load divided by 
the total per pin lever arm to that section. Figure 1b shows a roofed 10A 
arm that broke under test at a pole pin hole (critical) section from concen- 
trated loads at the ends of the arm. 


BrVELED 10B ARM 


For the investigation of this arm, let a breaking load of 1009 pounds at 
each end pin hole be assumed. Incidentally, it should be noted that so far 
as this analysis is concerned, the magnitude of the assumed concentrated 


loads is of no importance. However, since both computations and tests 
show the 10B arm to be stronger than the 10A, it seemed appropriate to 
assume a larger concentrated load for the 10B arm. 


As shown in Calculation 2 of the appendix, the bending moment at the 
center due to the 1000-pound load would be 56,000 pound-inches and the 
fiber stress 5882 psi, while at the pole pin holes the bending moment would 
be 40,000 pound-inches and the fiber stress 6885 psi. Here again, under 
concentrated loads at each end pin hole, the critical section is at the pole 
pin holes. 


*By total-per-pin lever arm is meant the summation of the distances from each pin 
position to the section concerned—in this instance to the pole pin hole section, 
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_ Calculating the load for each of the 10 pin positions in the same manner 
as for the roofed 10A arm, we have, tentatively, a load per pin of 400 pounds. 
However, in checking to determine whether the location of the critical sec- 
tion changes under loads of 400 pounds at each pin position, we obtain re- 
sults quite different from those in Calculation 1; for Calculation 2 indicates 
a fiber stress of 6885 psi at the pole pin holes, but a higher stress (7563 psi) 
at the center, which shows that the location of the critical section does 
change. Moreover, this change would occur whether the loads were 400 
pounds per pin or 4 pounds per pin. But let us now consider the 400- 
pound load. 

If a concentrated load of 1000 pounds results in a fiber stress at the pole 
pin hole section of 6885 psi and causes failure, that stress is the maximum 
ultimate fiber stress for the arm. It is, therefore, not reasonable to suppose 
that the same arm would have endured a higher stress (7563 psi) at the 
center if it had been loaded at each pin position. If 6885 psi is the maxi- 
mum stress for the arm,-the maximum moment it would endure at its center 
would be 65,500 pound-inches (viz. 6885 multiplied by 9.52, the section 
modulus of the center section). The maximum load per pin would be 364 
pounds (viz. 65,500 divided by 180, the total-per-pin lever arm to the center) ; 
and this load of 364 pounds, not 400 pounds, distributed at the 10 pin posi- 
tions is comparable to the 1000-pound concentrated load. Thus, while the 
critical section of a beveled 10B arm is at the pole pin holes when the load is 
concentrated at the arm ends, it shifts to the center when the load is dis- 
tributed at each pin position; and, moreover, the load is less than the load 
per pin tentatively computed. 

A graphic illustration of this shift of the critical section is shown in Fig. 2. 
Graph 1 in this figure is the graph of the resisting moments of a clear, 
straight-grained beveled 10B arm, 3.25” x 4.25” in cross-section, and having 
an assumed ultimate fiber strength in bending of 6000 psi. Each point 
in the graph is equal to the section modulus of the section under considera- 
tion multiplied by 6000 psi. Graph 2, which is the graph of a concentrated 
load at the end pin position, was drawn from the zero moment under the 
end pin to the point of greatest moment possible without intersecting re- 


sisting moment Graph 1. Since the point of coincidence between Graphs 


1 and 2 is the pole pin hole section, that section is the critical section for a 
concentrated load at the end pin. The magnitude of this concentrated 
load is equal to the resisting moment at the pole pin hole, 34,860 pound 
inches (viz. 5.81 inches’ x 6000 psi) divided by the 40” lever arm, or 871.5 
pounds. The load per pin, /en/atively figured, would be 34,860 pound- 
inches divided by 100 inches or 348.6 pounds. Graph 3 is the graph of a 
load (P) of 348.6 pounds at each pin hole. Under such loading, however, 
the bending moment at the center of the crossarm would be 62,748 pound- 
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inches (viz. 348.6 x 180), which exceeds the 57,120 pound-inches resisting 
moment at the center (viz. 9.52 inches* x 6000 psi). This means that the 
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Fig. 2—Moment diagrams for a beveled 10B crossarm: 
Graph 1—Resisting moments of a clear,straight grained, 3.25” x 4.25” arm. Fiber 
stress assumed to be 6000 psi; 
Graph 2—Bending moments from a concentrated load of 871.5 pounds at end pin hole; 
Graph 3—Bending moments from a load of 348.6 pounds at each pin hole; and 
Graph 4—Bending moments from a load of 317.3 pounds at each pin hole. 


arm would fail under such loading; and that the critical section of the arm 
under loads distributed at each pin hole is not at the pole pin holes but at 
the center of the arm. The maximum load per pin that the arm would 
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endure is the resisting moment at the center divided by the total-per-pin 
lever arm, or 57,120 pound-inches divided by 180 inches or 317.3 pounds 
Graph 4 is the-bending moment graph of this 317.3-pound maximum load 
per pin. 
SUMMARY 
= Concentrated load, 
P = Load per pin, 
Mp = Bending moment at pole pin hole section, 
fc = Fiber stress at center section, 
fp = Fiber stress at pole pin hole section, 
Sc = Section modulus of center section, and 
Sp = Section modulus of pole pin hole section. 


Using this notation, the results of the analyses may be summarized as 
follows: 


For 10A arms both roofed and beveled: 
Mp = 48W (for concentrated loads), and 
Mp = 120P (for pin loads). Therefore, 
_ 48W 


>= = 0.4W 
Pa = «04M 


For 10B arms-roofed: 
Mp = 40W (for concentrated loads), and 


Mp = 100P (for pin loads). Therefore, 


P= sa = 0.4W 


For the beveled 10B arm, however, where the critical section is at the 
center, the value P = 0.4W does not apply. The value of P would be such 
as to produce the same fiber stress at the center section as the fiber stress 
resulting from the concentrated load (W) at the pole pin hole section. Thus 


180P 
Sc 


fc = nd 


4( IW 
Sp 


fp = 
Equating these, we have 
180P . 40W 
—— = — and 
Sc 
40W ., Sc _ 2ScW _ 2 X 9.52W 


sine Sp “180 9p 9X58! 


= (),364W 
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Therefore, under the conditions assumed, and only under such conditions, 
we may say that the loads per pin (P) comparable to the assumed concen- 
trated loads (W) would be 

| 10A arms-—roofed 

P = 0.4W for 410A arms—beveled 
\10B arms—roofed 
and 


P = 0.364W for 10B arms—beveled 


While these results are restricted to the four arm types listed, the same 
principles followed in arriving at these results may be applied to other types 
and sizes of arms, and to other conditions of loading. Whether the conver- 
sion of single concentrated loads to loads per pin is performed by the method 
illustrated in Calculations 1 and 2 of the appendix, or is done by a moment 
diagram, as in Fig. 2, the procedure recommended is as follows: 

Step 1. Determine the critical section under the concentrated load. 

Step 2. Divide the bending moment at the critical section by the total-per- 
pin lever arm to the critical section to determine the load per pin. 

Step 3. Check the fiber stress (under such loads per pin) at various sections 
to see whether the location of the critical section differs under load 
per pin. 

Step 4. If it does differ, proceed as shown for the beveled 10B arm (viz., 
the comparable load per pin is equal to the resisting moment of 
the critical section divided by the total-per-pin lever arm to the 
critical section). If it does not differ, the load per pin as deter- 
mined in Step 2 is the comparable load per pin sought. 


CONCLUSIONS 


(1) The location of the critical section under loads distributed at each 
pin position must be determined before undertaking the conversion of 
concentrated loads to distributed loads. 

(2) The location of the critical section of a crossarm under a given condi- 
tion of loading may or may not be the same under a different condition of 
loading. 


(3) The load per pin comparable with a given concentrated load is equal 
to the resisting moment of the critical section divided by the total-per-pin 
lever arm to the critical section. 


(4) While the results shown are confined to the conversion of concentrated 
vertical loads to distributed loads for 10A and 10B arms only, the principles 
of the study may be applied to other types and sizes of arms and to other 
conditions of loading. 
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APPENDIX 


Calculation 1. Bending Moments and Fiber Stresses in a Roofed 10A 
Crossarm—(See Figure 3) 
Notation: 

800 pounds concentrated load 
Load per pin 
Bending moment at arm center 
Bending moment at pole pin hole 
Fiber stress at center 
Fiber stress at pole pin hole 
Section modulus of center section‘ 

= Section modulus of pole pin hole section‘ 


/-POLE PIN 
/—*HOLE 


8" he --=-12"--- men =- 12"---—>t- ----12"- ---Bya - --- 

















Fig. 3—Loading diagram for a roofed 10A crossarm. 


Concentrated Load: 
Mc = W X 56 = 800 X 56 = 44,800 pound-inches 
Mp = W X 48 = 800 X 48 = 38,400 pound-inches 
fc = Mc + Sc = 44,800 + 9.74 = 4600 psi 
fp = Mp + Sp = 38,400 + 5.11 = 7515 psi 
Load per Pin: 
Mc = 56P + 44P + 32P + 20P + 8P = 160P 
Mp = 48P + 36P + 24P + 12P = 120P 
(Note: The total-per-pin lever arms are 160” to center and 120” to the 
pole pin hole). 
Since under W load f is maximum at pole pin hole, the P load that would 
result in same f is P = 38,400 + 120 = 320 pounds. Thus 
fp = 120P + Sp = (120 K 320) + 5.11 = 5 psi 
fe = 160P + Sc = (160 XK 320) + 9.74 
Conclusion: 


Under both W loads and P loads, the critical section is the pole pin hole 
section. 


‘Sc = 9.74 and Sp = 5.11 jor clear roofed 3.25” x 4.25” crossarms. (See Pages 27 and 
28 of Bell Sys. Tech. Jour., Jan. 1945). 
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Calculation 2. Bending Moments and Fiber Stresses in a Beveled 10B 
Crossarm—(See Figure 4) 
Notation: 
W = 1000 pounds concentrated load 
P = Load per pin 
Mc = Bending moment at arm center 
Mp = Bending moment at pole pin hole 
fc = Fiber stress at center 
fp = Fiber stress at pole pin hole 
} Section modulus of center section® 
Section modulus of pole pin hole section® 


POLE PIN 
HOLE Ww 














ig. 4—Loading diagram for a beveled 10B crossarm. 


Concentrated Load: 
Mc = W X 56 = 1000 X 56 = 56,000 pound-inches 
Mp = W X 40 = 1000 X 40 = 40,000 pound-inches 
fe = Mc + Sc = 56,000 + 9.52 = 5882 psi 
fp = Mp + Sp = 40,000 + 5.81 = 6885 psi 
Load per Pin: 
Mc = 56P + 46P + 36P + 26P + 16P = 180P 
Mp = 40P + 30P + 20P + 10P = 1009P 
P = 40,000 + 100 = 400 pounds 
_ 100P _ 100 X 400 


- 7 = 6885 psi 
P= 3; 5.81 ns 


180 _ 180 X 400 _ 


fc = = 7563 psi 


=< 22 
Conclusion: 
Critical section shifts under P loads, and arm will not support 400 pounds 
per pin. 


5Sc = 9.52 and Sp = 5.81 for clear beveled 3.25” x 4.25” crossarms. (See Calculation 
3 of this appendix.) 
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Fig. 5—Beveled crossarm sections showing significance of the notation used in Calcu 
lation 3 of this appendix. 





Calculation 3. Section Modulus—Clear Beveled Sections 
(The notation used in this calculation is shown in Fig. 5) 


Section width (W) Inches 
Section depth (D) Inches 
V = (W — 1.25”) + 2 Inches 
U = V — .3248” Inches 
Areas: 


T Sq. Ins. 
Ri Sq. Ins. 
R2 Sq. Ins. 
R3 Sq. Ins. 


Total 1 Sq. Ins. 


t = h-+t (.1875” + 3) Inches 
rl = h + (.1875” + 2) Inches 
r2 = h — (1.59375” + 2) Inches: 
r3 = 1.78125” + 2 Inches 
Moments about MM: 
Tt Ins.* 
Rirl Ins.’ 
R2r2 Ins.* 
R3r3 Ins.* 

Total 2 Ins.3 
c = Total 2 + Total 1 Inches 
dt =iti-c Inches 
dri =ri—c Inches 
dr2 = r2 —c¢ Inches 
dr3 = c- 43 Inches 
Moments of Inertia: 

* i Ins.' 
TR1 Ins.* 
TR2 Ins.* 
TR3 Ins.‘ 
T (dt)? Ins.! 
Ri(dr1)? Ins.* 
R2(dr2)” Ins.4 
R3(dr3)? Ins.! 

I Ins.! 
y=D-¢ Inches 
Section Modulus: 

S=I[+y Ins.’ 


Center Section 


3.2 
4.2 


.0609 
.4876 
.1797 
.7891 
.5173 
. 1250 
. 1563 
. 2656 
.8906 


Oe, eK uw 


2512 
.0266 
.9148 
1558 


3484 
1141 
.0109 
.0422 
21515 
2151 


Ree th ht LS] DA bd 


.0001 

.0014 

1.0964 

1.5307 

[27 (dt)?] .2463 
2.0336 

6.8680 

8.5474 
20.3239 
2.1359 


9.52 


Pole Pin Hole 
Section 
3.25 





The Linear Theory of Fluctuations Arising from Diffusional 
Mechanisms—An Attempt at a Theory of Contact Noise 


By J. M. RICHARDSON 


The spectral density is calculated for the electrical resistance when it is linearly 
coupled to a diffusing medium (particles or heat) undergoing thermally excited 
fluctuations. Specific forms of the spectral density are given for several types of 
coupling which are simple and physically reasonable. The principal objective is 
the understanding of the frequency dependence of the resistance fluctuations in 
contacts, rectifying crystals, thin films, ete. 


1. INTRODUCTION 


HEN a direct current is passed through a granular resistance such as 
a carbon microphone or a metallic-film grid leak, or through a single 
contact, there is produced a voltage fluctuation possessing a component 
called contact noise which is differentiated from the familiar thermal noise 
component by the fact that its r.m.s. value in any frequency band is roughly 
proportional to the magnitude of the average applied voltage, and is differ- 
entiated from shot noise by the strong frequency dependence of its spectral 
density. One may regard this component of the voltage fluctuation as aris- 
ing from the spontaneous resistance fluctuations of the element in question 
if one is willing to allow the resistance to have a slight voltage dependence. 
This effect has been the subject of numerous experimental investigations,’ * 
me J ~ : a > a > 9 
among which we mention in particular that of Christensen and Pearson 
on granular resistance elements. These authors (henceforth abbreviated 
as CP) arrived at an empirical formula, to be discussed presently, connect- 
ing the contact noise power per unit frequency band with the applied volt- 
age, the resistance, and the frequency for several types of granular resistance. 
Their measurements covered a range of frequency from 69 to 10,000 cps, 
and involved the variation of several other parameters, i.e., pressure. More 
recently, Wegel and Montgomery’® have measured the noise power arising 
1H. A. Frederick, Bell Telephone Quarterly 10, 164 (1931). 
2A. W. Hull and N. H. Williams, P/rys. Rev. 25, 173 (1925). 
5R. Otto, Hoclifrequenztechnikund Elektroakustik 45, 187 (1935). 
4G. W. Barnes, Jour. Franklin Inst, 219, 100 (1935). 
5 Erwin Meyer and Heinz Thiede, E. N. T. 12, 237 (1935). 
6 F. S. Goucher, Jour. Franklin Inst. 217, 407 (1934). Bell Sys. Tech. Jour. 13, 163 
(1934). 
7J. Bernamont, Annales de Pliys., 1937, 71-140. 
8M. Surdin, R. G. E., 47, 97-101 (1940). 


9C. J. Christensen and G. L. Pearson, Bell Sys. Tech. Jour. 15, 197-223 (1936) 
10 Private communication. 
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from single contacts and have obtained results in agreement with the CP 
empirical formula down to frequencies of the order of 10-' — 10~° cps. 
Significant theoretical work upon this problem has not been attempted 
until recently. G. G. Macfarlane has advanced a theory based upon a 
non-linear mechanism containing one degree of freedom which seems to be 
in agreement with the CP law. W. Miller’ has worked out a general theory 
of noise in crystal rectifiers. His theory is linear, contains essentially an 
infinite number of degrees of freedom, and is equivalent in many respects 
to the theory discussed in this paper; however, he has not succeeded in ob- 
taining agreement with the experimental data on crystal rectifiers (which 
satisfy approximately the CP law) for any of the specific models he used. 
The purpose of this paper is the calculation of the spectral density of the 
fluctuations of the electrical resistance when it is linearly coupled to a diffus- 
ing medium (particles or heat), or, mathematically speaking, is equal to a 
linear function of the concentration deviations of this diffusing medium. 
This diffusing medium undergoes thermally excited fluctuations and thereby 
causes fluctuations in the resistance. The motive behind this investigation 
was the understanding of the frequency dependence of contact noise dis- 
cussed in the following paragraphs, but at the present time it is apparent 
that this treatment in addition may apply to rectifying crystals, thin films, 
transistors, etc. The quantitative details of the coupling between the resist- 


ance and the diffusing medium are not considered here; in consequence of 
which, this work can hardly pretend to give a complete explanation of con- 
tact noise. However, important results are given concerning the relation 
between the spectral density of the resistance, on one hand, and the geom- 
etry of the coupling and the dimensionality of the diffusion field on the 
other. 


Now let us consider the CP empirical formula in detail. Let R be the 
average resistance’ of the contact (we will henceforth consider only contacts 
and will regard a granular resistance as a contact assemblage) and let R, 
(¢) be the instantaneous deviation from the average. By theorems I-3 of 
Appendix I, we can express the m.s. value of Rj as a sum of the m.s. values 
of R; in each frequency interval as follows: 

a) 
Ri = [| Slo) do, (1.1) 
“0 

'G. G. Macfarlane, Proc. Pirys. Soc. 59, Pt. 3, 366-374 (1947). 

To be published. 

'8 The resistance of a contact is composed of two parts: the “gap resistance’ and the 
“spreading resistance.” The term “gap resistance” is self-explanatory. The “spreading 
resistance” is the resistance involved in driving the electric current through the body of 
the contact material along paths converging near the area of lowest gap resistance. The 
measured contact resistance is the sum of these two parts. In some of the particular 
physical models considered in Section 5, R is taken to be the gap resistance necessitating 
ad hoc arguments relating gap resistance and total resistance. 
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where S(w) is called the spectral density of R; and w is the frequency in 
radians per second. Now in our notation the CP formula may be expressed 


S(w) = KV: her? ‘w, (1.2) 


where V is the applied d-c voltage across the contact, K is a constant de- 
pending upon the temperature and the nature of the contact, and a and 6 
are constants having values of about 1.85 and 1.25 respectively. CP state 
that the constant K is equal to about 1.2 X 10-"° in the case of a single 
carbon contact at room temperature. 

In this paper we will regard the nonvanishing of a — 2 as arising from 
a non-linear effect which should become negligible at a sufficiently low 
voltage, although this interpretation does not seem completely justified on 
the basis of the work of CP. Consequently we assume that a— 2 as V — 0 
in such a way that V*? -— 1. This is in keeping with the idea that the 
resistance fluctuations are truly spontaneous—at least for smal! applied 
voltages. 

Although Eq. (1.2) may represent the observations over a large range of 
frequency it must break down at very high and very low frequencies in 
order that the noise power be finite (or, in other words, in order that the 
integral (1.1) converge). 

One has several clues to be considered in looking for an underlying mecha- 
nism of the resistance fluctuations. First of al!, the mechanical action of 
the thermal vibrations in the solid e‘ectrodes of the contact seems to be 
unimportant because of the following reasons: (1) there are no resonance 
peaks in S(w) at the lowest characteristic frequencies of mechanical vibra- 
tion of the contact assembly; (2) S(w) becomes very large far below the 


lowest characteristic frequency; and (3), according to CP, R; is strictly 
proportional to V? when the fluctuations are produced by acoustic noise 


vibrating the contact, whereas Rj is proportional to V*~?, a ~ 1.85, when 
the fluctuations arise from the dominant mechanism existing in the macro- 
scopically unperturbed contact. One of the obvious mechanisms left is a 
diffusional mechanism. Such a mechanism does not violate any of the ob- 
servations to date and, furthermore, possesses a sufficient density of long 
relaxation times to give large contributions to S(w) near zero frequency. 

Evidence that diffusion of atoms (or ions) can be important in modulating 
a current is provided by the “flicker effect” in which the emission of elec- 
trons from a heated cathode is caused to fluctuate by the fluctuations in 
concentration of an adsorbed layer. We might suppose that contact noise 
is a different manifestation of the basic mechanism involved in the flicker 
effect. 

In view of these considerations it seems worthwhile to investigate in a 








120 BELL SYSTEM TECHNICAL JOURNAL 


general way a large class of models involving resistance fluctuations arising 
from diffusional mechanisms. In the next section we propose a general 
mathematical model embracing a class of linear diffusional mechanisms. 
In Sections 3 and 4 the consequences of the general mathematical model 
are obtained by the “Fourier” and “Smoluchowski’” methods, respectively, 
these alternative methods leading to identical results. In Section 5, the 
general results are specialized to several physical cases, some of which are 
introduced only for the purpose of providing some insight into the relations 
between the possible physical mechanisms and the resultant resistance 
fluctuations, and one of which along with its refinement is a successful! 
attempt to provide a theory of Eq. (1.2). Section 6. is a summary. 


2. THe GENERAL MATHEMATICAL MODEL 


The physical models which we consider in this paper are concerned with 
the fluctuations of contact resistance arising from a diffusional process. We 
are consequently led to consider the following general mathematical model 
embracing a rather extensive class of the physical models as special cases: 
Let us consider the instantaneous contact resistance R(t) to be related to 
the intensity c(r, 4) of some diffusing quantity as follows!®: 


G(R@) = [ FC, ctr, ) ar, (2.1) 


where 7 is a vector in two or three dimensional space depending on whether 
the diffusion takes place on a surface or in a volume, and dr is correspond- 
ingly a differential area or volume. The intensity c(r, /) may be either a 
concentration (in the case of diffusion of material in two or three dimensions) 
or a temperature (in the case of heat flow in three dimensions). In writing 
Eq. (2.1) we have evidently assumed that the contact resistance R(t) is 
independent of the applied voltage. Eq. (2.1) may of course allow a de- 
pendence on voltage through the quantity c; however, we will consider no 
processes involving a dependence of c on the voltage. These restrictions, 
strictly speaking, make the model applicable only in the limit of low applied 
voltages. 

Before proceeding further let us limit the treatment to the case in which 
the deviations of R and ¢ from their average values are sufficiently small 
for higher powers of these deviations to be neglected. Let 


RV) = R+ RAO, (2.2) 
c(r, 4) = E+ alr, 0), (2.3) 


4 That is, successful in so far as agreement with the form of Eq. (1.2) is concerned. 

16 A relation more general than R(t) = SF(r, c(r, t)) dr is required as one can see from 
considering the special case of a total resistance composed of a parallel array of resistive 
elements. 
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where R and Z are the average’ values of R(t!) and c(r, ‘) respectively. 
Evidently, R(t) = 0, and ¢(r, ¢) = 0. Introducing the expressions (2.2) 
and (2.3) into Eq. (2.1), expanding in terms of ¢,(r, ‘), and neglecting terms 
of the order of cj, we get 


Ri) = { f(r) alr, 0) ar, (2.4) 


_ Parc, 0) dG(R) 
no - [2] /[2P., 


The function f(r) defines the linear coupling between R; and c¢, and depends 
upon the specific physical model used. The non-linear terms neglected in 
Eq. (2.4) may be of importance under some conditions; however, we will 
not consider them here. Nevertheless, non-linear effects in the behavior of 
¢, itself are possibly important in determining the form of the power spec- 
trum of R,(/) in the neighborhood of zero frequency. 


3. THE FourreR SERIES METHOD OF SOLUTION 


In this section we consider the state of the diffusing system to be defined 
by the Fourier space-amplitudes c,(/) of a(r, ¢). The time behavior of cg(¢) 
will be described by an infinite set of ordinary differential equations con- 
taining random exciting forces according to the conventional theory of 
Brownian motion.” This method yields the spectra! density of R;(¢) directly. 

Now the diffusion process is assumed to occur in a rectangular area A; = 
I, X Lz or in a rectangular parallelopiped of volume A; = L; X Lz X Ls. 
In regions of the above types, if we apply periodic boundary conditions”, 
1(Yof) may be expanded in Fourier space-series as follows: 


‘ex(r, 2) = D0’ cy(te** (3.1) 
k 


where the components of & take the values 
k; = 2en;/L;,i = 1,---,», (3.2) 


in which n; are integers and v is the number of dimensions. The prime on 
the summation indicates that the term for k = 0 is to be omitted. This is 
required by the equivalence of the time and space averages of c (true for 
A, sufficiently large) and by the vanishing of the time average of q (by 
definition). 

‘6 The average values here may be considered as either time or ensemble averages but 
not space averages. 

'7 See Wang and Uhlenbeck, Rev. Mod. Phys., 17, 323-342 (1945) 

'8 Tf the final results are given by integrals over k- space they will be insensitive to the 
boundary conditions. 
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Before proceeding to the solution itself let us consider what it is that we 
wish to know about c,(/). Expanding the function f(r) of Eq. (2.4) in a 
Fourier space-series in the region A, , 


f(r) = d fue’*", (3.3) 
we can write Eq. (2.4) in the form 


R(t) = A, + By fece(l) (3.4) 


where fg is the conjugate of fr. 
The spectral density S(w) of Ri(¢) is then 
Sle) = A? Do’ Cow (w)fater, (3. 
kk’ 


where Cgg (w) is the spectral density matrix for the set cg(/) given by 


Cer (w) = 27 Lim ; [ce(w, r)cR(w, 7) + ce(—w, ce (—w, 7)] (3.6) 


tT70 T 


in which 


1 +r/2 isa 
Ce(w, 7) = a ce(le ne: dl. (3.7) 


—r/2 


For a full discussion of spectral densities and spectral density matrices see 
Appendix I. Consequently our objective in this section is the calculation 
of the maxtix Cgg: (w) deiined by Eq. (3.6). 

Now we assume that (7, ¢) satisfies the diffusion equation 


5 a(r, t) = Dyalr, ) + g(r, 0) (3.8) 


where D is a constant, V? is the Laplacian operator in two or three dimen- 
sions, and where g(r, /) is a random source function, whose Fourier space- 
amplitudes g,(/) possess statistical properties to be discussed presently. 
The random source function g is required for exciting c sufficiently to main- 
tain the fluctuations given by equilibrium theory. In the case of material 
diffusion the random source function g may be discarded in favor of a ran- 
dom force term of the form —D/ x T-¥-(f(é + «)], where V-f = divf, x 
is the Boltzmann constant, 7 is the temperature, and f is the random force; 
however, in the linear approximation these two procedures will give identical 
final results. In the case of heat flow it is understood that the diffusion 
constant is D = K/pC where K is the thermal conductivity, p (ic density, 
and C the specific heat. Eq. (3.8) as written is valid only for D a constant 
and ¢ small. 
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Introducing the expansion (3.1) and the expansion 
g(r, 1) do” ge(ie (3.9) 
k 


into Eq. (3.8) we obtain the infinite set of ordinary differential equations 


d 


7 ck) = — Dkcy(t) + geld), (3.10) 
¢ 


k=|k|, 


describing the time behavior of the Fourier space-amplitudes cg(/). The 
Fourier space-amplitudes g,g(/) are assumed to be random functions of / 
possessing a white (flat) spectral density matrix Cxg , independent of fre- 


tu 


= ° ° ° 
, Integrating with respect to time 


quency. Multiplying Eq. (3.10) by .. e 
27 


from —}r to + }r, and neglecting the transients at the end points of the 
r-interval, we obtain 


Lew, 7) 


(3.11) 
iw + Dk? 


Cel(w, 7) = 
where cg(w, 7) is given by Eq. (3.7) and gg(w, 7) is given by an analogous 
equation. Forming the spectral density matrices we get for the diagonal 
elements 


C o( Jj = 
RRO w? + Dk! 


The matrix Ggg can now be evaluated by the thermodynamic theory of 
fluctuations (See Appendix II). This theory gives 


XO ek’ 


cele (1) _ 4 5 


where 


\lifk = R’ 
See = 4 
0 otherwise, 
J fa’s 1 fa°e| 


( 
§ ‘== { — ? (3.14) 
lac! c=e - lac? e=e 


s and e being the entropy and energy, respectively, per unit area or volume, 
7’ the average temperature, and x the Boltzmann constant. In the case 
where c is the concentration of particles whose configurational energy is 


constant, s” = x/é. If c be the temperature 7 then s” = C/T? where C 
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is the heat capacity per unit area or volume. Now by a general theorem 
concerning spectral density matrices (see Appendix I) we have 


cece (0) = [ Cre (w) da, 
0 
giving finally by combination with (3.12) and (3.13), 
2 xDk° 


Grr’ = 
nw A,s”’ 


2 xD koe: 


Cer’(w) = TAS’ @t DE 


The spectral density S(w) of Ri(t) then becomes 


2 xA,D R’ | fk |’ 
Sw) = - ——— : —. (3.17 
(w) ee ) eee ( 17) 
If we are concerned with frequencies greater than a characteristic fre- 

quency 
wo = 4n°D/L? (3.18) 


where L is the smallest of L;, 7 = 1, --- , v, then the summation in (3.17) 
may be replaced by an integration giving 


OY sa pvt rie | | {(R) |’ k* dk 
S(w) = 2°" « 7 o + De 


: (3.19) 
§ 


where 
i 2s. —tk-r 
f(R) = (2 |, soe dr. (3.20) 


The integration in Eq. (3.19) is carried out over the entire v-dimensiona] 
k-space. If the range of the function f(r) is sufficiently small compared with 
the region A, , or if we let A, become indefinitely large, then the integration 
in Eq. (3.20) may be extended to all of v-dimensional r-space. 

It is perhaps revealing to rephrase Eqs. (3.17) and (3.19) in terms of 
distributions of relaxation times. In the theory of dielectrics we speak of 
the real part of the dielectric constant being equal to a series of terms 
summed over a distribution of relaxation times: 2, a;7;/(1 + 7iw?), if the 


distribution is discrete, or | a(r)rdr/(1 + 7*w*), if the distribution is con- 
0 


tinuous. In the above, a; is the weight for the relaxation time 7; , and, in 
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the case of a continuous distribution, a(7)dr is the weight for the relaxation 
times in the range dr containing r. In these terms Eq. (3.17) becomes 


S(w) = >> nt Se - (3.17a) 
k 1 +rtpw 


2 x 2 " 
dp = — 5, |\fe|- (3.17b) 
Ss 


Eq. (3.19) becomes 


- a(r)r dr 


S(w) = | 
we) tree 


vy wl 
a(r) = oro | | f(l/- Dr) |° dQ, (3.19b) 
in which 7 is the unit vector in the direction of k, dQ, is the differential 
“solid” angle in the v-dimensional k-space, and the integration is over the 
total solid angle (2x in 2 dimensions, or 47, in 3). 

It is of interest to calculate the self-covariance R,(t)Ri(t + «). In Appendix 
I, it is shown that the self-covariance above is related to the spectral density 
S(w) as follows: 


ROR + u) = [ S(w) cos uw dw. 
0 


Using S(w) in the form (3.17), Eq. (3.21) gives 
R(t) R(t + 0) = xA,, yr p | fe ‘See 
A 


u> 0; 


whereas with S(w) in the form (3.19) we get 


ROR + u) = (2n)" x/s” | | f(k) |e?" dk. 
The method of the next section yields the self-covariance directly. 


4. SMOLUCHOWSKI METHOD OF SOLUTION 


We call the procedure employed in this section the ‘‘Smoluchowski 
method” because it is based on an equation very closely analogous to the 
well-known Smoluchowski equation forming the basis of the theory of 
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Markoff processes.'* We set out directly to calculate the self-covariance for 
R,(t) which, by Eq. (2.4), is given by 


ROR At + w) = [[ separ da, he 1) dr’ dr. (4.1) 


Thus the problem is now reduced to the calculation of ¢(1’, Ac(r, t + 4). 

The quantity o(r’, t)c:(r, ¢ + u) is calculated in two steps. First we find 
haces (t+u) 
a(r,t + u) , the average value of c, at the point r at the time / + u 
with the restriction that c¢ is known at every point 7’ with certainty to be 
o(r’, t) at the time ¢ (assuming, of course, that « > 0). Then we find that 
the required self-covariance for c is given by multiplying the above 
ea (t+u) 
o(r, i + u) by a(r’, ¢) and averaging over-all values of c(r, ¢) at 
time /; thus: 

(t) 








ar’, Da(r,t+u)"™ = ar’, Da(r, t + u). (4.2) 
ee rE a gh 

Now we assume that c(r, ¢ + wu) is related to c(r’, ¢) by an integral 

equation, analogous to the Smoluchowski equation, as follows: 


peng 
a(r,t + u) = | po(jr —r’ |, w)c(r’, t) dr’. (4.3) 


In the case that c represents a concentration as in the diffusion of particles, 
p(\r—r’ , «) dr is the probability that a particle be in the v-dimensional 
volume element dr at time ¢ + wu when it is known with certainty to be at 
r’ a time /. Now the number of particles in dr’ at r’ at time / is evidently 
|é + o,(r’, ()| dr’; consequently, the probable number of particles in dr at 
time / + wu which were in dr’ at time / is p(| r — r’ |, w)[e + a(7’, 2] drdr’. 


Integration over dr’ gives the total probable number 
(t+ u) 


(+ alr, t+ u) ) dr of particles in dr equal to 
( p(jr—r' |, wWlé+ alr’, | ir’) dr which reduces to 


(« + | 00 r—r’|, u)a(r’, t) ir’) dr. Division by dr and subtraction of ¢ 


from both sides of the equality leads directly to Eq. (4.3). For the case of 
heat flow in crystal lattices the above picture can be used approximately if 
one uses the concept of phonons.” For a diffusional process p{| r — r’ |, u) 
is the normalized singular solution of the diffusion equation”; thus 


p(jr—r|,u) = exp [—|r — r’ |*/4Dul] (4.4) 


1 
(4rDu)’” 
19 Loc cit. 

20 J. Weigle, Experientia, 1, 99-103 (1945). 
"1 Chandrasekhar, Rev. Mod. Phys. 15, 1 (1943). 
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where », as previously defined, is the number of dimensions of the region in 
which the process occurs. 
Combining Eqs. (4.2) and (4.3) we get 


FS ae Oe _ (bt) 


alr’, Dar, t + u) = / ar’, He(r’’, 2) p(ir—r’|,u)dr’’. (4.5) 
Now using the fact that 
al, Dal”, 
and using the relation 
alr’, alr’, ) = *, 5(r’ — r’’) 


proved in Appendix II, Eq. (4.5) reduces to 


alr’, Dalr,té+u) = 4 pir —r’|, u). (4.8) 
S 


+ 


Introducing the expression (4.8) into Eq. (4.1) we obtain at once the desired 
result 


R(OR(t + wu) = X [[ serge r — r’|, «) dr dr’ 
AY 
(4.9) 


on Pie Day ; | f(r) f(r’) exp [—|r — r’ |/4Du] dr dr’. 


For the sake of comparison with Eq. (3.23) it is necessary to write (4.9) 
in terms of the Fourier space-transforms of the pertinent quantities. We 
write 


f(r) = | f(k) e'®* dk 


{(k) = | f(r)e"*' dr. 
(2m)" 
Also, we write 


1 > 
(lIr—r'l,u) = ' xp{[—lr—r’'!’/4D 
ie (iaDuye &*P | wi 


= G, _ exp [—Duk* + ik-(r — r’)] dk. 
T © 


After introduction of these expressions into (4.9) a short calculation yields 
the result 


R(OR(t + u) = (24)” X | [f(k) Peo dk (4.10) 
§ 
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which is identical with Eq. (3.23) (provided that we let A, — © in the lat- 
ter). Thus the methods of approach used in Section 3 and in this Section 
are completely equivalent. 


5. SpecIAL PHysicAL MopELs 


In the previous two Sections we have developed by two different methods 
the consequences of the general mathematical model discussed in Section 2. 
Here we apply the general results to some special physical cases. In this 
task we will be principally concerned with finding the form of the function 
f(r) and establishing the number of dimensions » of the diffusion field. The 
main objective here is to provide some orientation on what mechanisms are 
or are not reasonable and to find at least one mechanism leading to the 
observed spectral density (inversely proportional to the frequency). 

a. A General Class of Models. Here we consider all at once mechanisms 
which can be adequately represented by having f(r) a v-dimensional Gaussian 
function of the form 


f(r) L ain ¢ 1/28 (5.1) 


where A?” is the “width” of the function measured along the i-th coordinate 
x;. This form of f(r) can represent approximately several types of localiza- 
tion of the coupling between R, and ¢, , as will be seen in the special examples 
later. Now if we work with A, = ©, we will then have to consider the Fourier 
space-transform of f(r), which is readily shown to be 


d ie ea 
f(k) = gy | fr ar 


II b;, Dae ibil? | 
i=1 


Inserting this result into Eq. (3.19) we obtain immediately 


2xD (i i) | exp (- Ay Ki) k’ dk 
as’ iol 2 


w? + D k4 


2 #. 


i=] 


Inserting this expression (5.2) into Eq. (3.23) gives 


7 _ b; 
R(t) Ri(t + u) = s/ I] (4n(A, i. Du)}'?’ u> 0 
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In order that (5.3) give the observed S(w) a 1/w as a result, the integral 


nust reduce to something proportional to / k dk/(w? + D*k*). It is clearly 


impossible that any choice of v and any set of A; can achieve this result. 
Furthermore the self-covariance R,(/)R,({ + u) corresponding to the ob- 
served S(w) should depend explicitly on the way S(w) deviates from 1/w 
as w goes to zero. The expression (5.4) is finite for all uw > 0 and does not 
depend upon any cut-off phenomena in S(w) at low frequencies. Therefore 
we can exclude any physical mechanisms belonging to the class considered 
here. However, since several mechanisms that have been proposed do fall 
into this class, we consider them below: 

(i) Schiff’s Mechanism. Schiff* considered tentatively that the fluctua- 
tions in contact resistance may be due to the variation in concentration of 
diffusing ions (atoms, or molecules) in a high resistance region bounded by 
parallel planes of very small separation. Schiff arrived at a noise spectrum 
proportional to 1/w but at the expense of disagreeing in a fundamental way 
with thermodynamic fluctuation theory. Here we will show what the cor- 
rect consequences of this mechanism are. 

Consider that the high resistance region is bounded on either side by planes 
parallel to the (a; , a2)-plane and that the thickness in the x;-direction is 
very small. Now this is obviously a case of the general model just considered 
in which we take vy = 3 and 


Ai.2 > Du, | 
A; <Du,! 


where 1/u is of the order of magnitude of the frequencies of interest. It is 
then a matter of algebraic manipulation to show that 
D bby ty ° ki dk 
Slo) >: <n a = oa 
s 2r A, As” “0 Ww + D ky 
x b; b3 5 


= 1/2 Sf? wb/2_ 2, 1/2, 1/2 
Page Pea, bs 


and 
x b; b3 b3 1 


ow SET aT u> 0. (5.7 
Di? s (4r)°? Ay? AS” ui? ” 


Ri(DR Ce + u) = 
Thus we see that Shiff’s mechanism leads to a noise spectrum proportional 
to 1/w'/?, not 1/w. The explanation of the singularity of the self-covariance 
(5.7) at « = O lies in the inequalities (5.5). 


20. I. Schiff, BuShips Contract NObs-34144, “Tech. Rpt. % 3’, (1946). Before the 
publishing of this paper, Schiff informed the writer that he has discarded this mechanism. 
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The above treatment could just as well be applied to the case in which 
the diffusing quantity is heat instead of ions. 

(ii) Resistance of a Localized Contact Disturbed by a Diffusing Surface 
Layer. Here we consider the case of two conductors covered with diffusing 
surface layers. It is supposed that the conduction from one conductor to 
the other is distributed Gaussianly with a width A’. Finally, it is supposed 
that the conductivity through the above area varies with the surface con- 
centration of the surface layer in that region. This situation is well repre- 
sented by the above general model by taking vy = 2, A; = A», = A, and 
b, = be = 8. 


The self-covariance is readily calculated with the result 


ph! 
x . 
Ss’ 4r(A + Du) 


Ri (R(t + u) = 
The corresponding spectral density is 


a 1 et cosuwdu 1 xb — cos (wA/D)Ci(wA /D) 
— age bo A+ Du 2x s’D ee “— 
(5.8a) 


+ sin (wA/D) (5 ~ siles/D)) | 
where Ci(x) and Si(x) are the cosine and sine integrals™ respectively. When 
w << D/A 

1 xb 


(wim — og 5.8 
S(w) ~ — 524 Jp log (604/D), (5.8b) 


6 = 0.5772, 
and when w > D/A 


1 x0'D 1 


S(w) » ere 
" 2s? s'A* w* 


(5.8c) 
Thus we see that this case does not lead to the experimental form of the 
spectral density. It must be remarked that here S(w) is very sensitive to 
the form of the self-covariance for small w. 

b. Contact between Relatively Large Areas of Rough Surfaces Covered with 
Diffusing Surface Layers. We consider this case in detail since it leads to 
results in agreement with experiment. Furthermore, the more detailed 
consideration of this case will illustrate more fully the use of the general 
mathematical model, which may be of use in studying other diffusional 
mechanisms should they be postulated at some future time. This mechanism 
does not fall into the class just considered. 


8 See Jahnke and Emde, “Tables of Functions,” p. 3, Dover (1943). 
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Suppose that the contact in an idealized form consists of two rough sur- 
faces close together. Let positions on the surfaces be measured with respect 
to a plane between the surfaces, which we will call the mid-plane. Let the 
coordinate system be oriented so that the x; and x» axes lie in the mid- 
plane. Furthermore let the region in the mid-plane corresponding to close 
proximity of the rough surfaces be a rectangular area Ay = LL, X Ly. Now, 
for convenience, we describe positions on the mid-plane by a two dimensional 
vector r = (x;, X2), and henceforth it will be understood that all vector 
expressions refer to this two-dimensional space. Let the distance between 
the upper and lower surfaces at r be denoted by /(r). The geometry of the 
above model is illustrated in Fig. 1. 

Now suppose that each surface is covered by a diffusing absorbed layer, 
such that the sum of the concentrations on both surfaces is c(r, /) at the 
time / in the neighborhood of r. Now consider the conduction of current 
between the surfaces. Let us assume that the conductance per unit area 
(of mid-plane) is a function of the separation / of the surfaces and the total 
concentration ¢ of absorbate near the point in question, i.e., /(h, c). The 
total conductance will be the sum of the conductances through each element 
of area: hence, the instantaneous resistance R(/) at time ¢ will be given by 


1/R(0) =| F(h(r), c(r, 0) dr (5,9) 
Ae 


where dr is the differential area on the mid-plane and the integration extends 
over the rectangle A, = 1; X Le. Behind the above statements lies the 
tacit assumption that the radii of curvature of the rough surfaces are gen- 
erally considerably larger than the values of . However, we will not explic- 
itly concern ourselves with this implied restriction. 

At this point it is expedient to imagine that we have an ensemble of con- 
tacts identical in all respects except for different variations of the separa- 
tion A(r). If we have any function of h, ¥(/) say, which we wish to average 
with respect to the variations of h, we simply average the function over the 
above ensemble giving a result which we denote by ¥(h) . . 

Now let us write 


h(r) = h°? + h(n), 


and, as before, 


Rt) = R+ RH, 
, ; (5.11) 
cy, () = €+ G(r, 2). | 


We assume that the ensemble average 4“ and the time averages R and é 
are constants independent of r and ¢. Let us also assume that the integrals 


a eT 
dh Ra, er Poa toe 


ea 
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of f(r) and ¢,(r, !) over A» vanish. Inserting (5.10) and (5.11) into (5 9) and 
expanding, we get 


1/R waa Ri()/R? 4+...-= As Fh, z) 


Z F ° " 3° F ° , sy (5.12) 
+ (3*) iE Iy(r)ea(r, 1) dr + 4 (54) is hi(r) dr +--+, 


where the super zero on the derivatives indicates that they are evaluated at 
h = h'°? and c = Z. In accordance with previous approximations in this 
memorandum we neglect™ terms of the order of ci and R}.. We also neglect 
terms of the order of /} . After taking the time average of (5.12) and sub- 
tracting the result from (5.12) we get 


R(t) = [ f(nalr, 0 i 


f(r) = al?’ h,(r), 


3] 
az eseiiiniietes 
Ohdc 


Thus we now have a special case of our general mathematical model for the 
number of dimensions v = 2, provided that we assume that the total con- 
centration c on both of the rough surfaces fluctuates in the same manner as 
the concentration of a single adsorbed layer confined to a plane rectangular 
surface. The spectral density S(w) of Ri(/) is then given by Eq. (3.17) which 
we repeat here 





9 
4 


S(w) = = (5.14) 


w? + Dk! 
where & is a two-dimensional vector whose components take the values 
ky = Inni/Li,ni = 0, + 1, 4 2,--- , and where f, are the Fourier space- 
amplitudes of f(r) given by 


ly Az | f(ne*** ar, 
Ag 


It may be appropriate at this point to consider the quantity s” in detail 
for this particular case. If the energy e per unit area is independent of c, 


> 


sal O's é ; 
we have s” = — a8 evaluated at c = ¢ where s is here the entropy of the 
ou” 


absorbate per unit area. For the sake of illustration let us consider a single 
layer of absorbate in which the molecules are non-interacting. If c, the sur- 


**We neglect these terms not because they are small compared with ¢; or Mic: but, 
because they are non-fluctuating (in time), are hence to be compared with 1/R. 
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face concentration of the absorbate, be measured in molecules (atoms, or 
ions) per unit area, then, for the ideal system above, it follows that s = —x 
c log c and finally that s” = x/é. However, in the mechanism discussed in 
this part we have a compound system consisting of two separate layers on 
the upper and lower surfaces respectively. Nevertheless, a detailed analysis 
reveals that with c equal to the sum of the concentrations of both layers 
we still have s”” = x/Z even though s itself is no longer given by an expres- 
sion the same as that above. In conclusion let us consider the factor x/s” 
in Eq. (5.14). This factor is under the above idealization simply equal to 
é. That is, the spectral density S(w) is directly proportional to the average 
concentration of absorbed molecules, meaning simply that each molecule 
makes its contribution to the resistance fluctuations independently of the 
others. Of course, in any real system this will not be quite true; however, 
the existence of interactions will be manifested only by making x/s” not 
equal to é in Eq. (5.14). 

The results quoted thus far apply to a system with a given /:(r). Now we 
shall average the right-hand side of Eq. (5.14) over the ensemble of varia- 
tions of A(r), it being supposed that S(w) itself on the left-hand side will 
be negligibly affected by this operation. This amounts to replacing | fy | 
by Tree We then have 

fe) = —-——(e) 
\fe\? = ah! | hep|? (5.15) 
where /y are the Fourier space-amplitudes of /;(r). 


We now consider more closely the problem of calculating | hy re We want 
to assume that /,(r) is a more or less random function of r. If /,(r) were a 
random function of r in the same way that the thermal noise voltage is a 
random function of the time /, then The would be a constant independent 
of k and the self-covariance hy(r)hy(1’) . would vanish for r # r’. This 
clearly cannot be so, since the function /;(r) with such statistical properties 
would represent a highly discontinuous type of surface incapable of physical 
existence. We then fall back upon the more reasonable assumption that the 
gradient of /, possesses statistical properties of the above type. This notion 
is precisely formulated by means of the following equations: 


Vin(r) = pr) (5.16) 
where 


/ p(r) dr = 0, (5.17) 


) 


pir)pir’)” = B1a(r — 1’). (5.18) 
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In Eq. (5.18) 6 is a parameter (with the dimensions of area) characterizing 
the amplitude of the surface roughness, and 1 is the unit tensor in two di- 


mensions. Expressing (5.16) in terms of the Fourier space-amplitudes hp 
and pz of /, and p respectively, we have 
—ik hp = pe, (5.19) 
giving finally 
——,(«) a——a(e) 3 ra, . 
lite|? =R-pepe + k/k (5.20) 
Expressing (5.18) in terms of Fourier space-amplitudes we get 


—- (e) = al Ss 
pede = BAr' Lone’, (5.21) 
which, when inserted into (5.20) gives the following desired result: 


an® 


liiaP = BAz' R?. (5.22) 


. - poe (8) . a ° ° 
Now replacing | fg |? by| /e |? in Eq. (5.14) and substituting the ex- 
pression (5.22) with the use of Eq. (5.15), we obtain 


: wi a 
S(w) = . a pa kt > 
e 3 


1 


k w” -b D4 (5.23) 


If the frequencies of interest are larger than a certain characteristic frequency 
w) = 42° D/L? where L is the smaller of 1; and Ly, the summation in 
(5.23) may be replaced by an integration giving finally 


as ate a 
S(w) = xD/n's"-c' BE As: 
ey ee | w? + Ds (5.24) 


= Xx 4s” Ba R*As-1/w 


This result is in agreement with experiment in most respects. The dependence 
on frequency is, of course, that experimentally observed by all investigators. 
The non-dependence on the voltage applied across the contact is implied 
by the basic assumptions common to all of the mechanisms considered here, 
and is in approximate agreement with the results of Christensen and Pearson 
(see Eq. (1.2)). For our result to agree with the results of CP as regards the 
dependence on the average resistance” &, the factor C4 A, must be pro- 
portional to R?+® where 6 ~ 1.25. These authors also imply that some of 

25 Tt must be remembered that the resistance R in the CP formula is the total contact 
resistance equal to sum of the gap resistance and the spreading resistance, whereas the R 
in our theory evidently should be considered the gap resistance. For the purposes of com- 


parison we make the ad hoc assumption that the gap resistance is proportional to the total 
contact resistance. 
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the parameters necessary to complete the description of a contact between 
given substances at a given temperature show up implicitly only through 
R. According to our theory the factor a? #* A, does not depend in any unique 
way upon R; it matters by what means # is varied. If the resistance RF is 
changed by altering the contact area A, , keeping other parameters fixed, 
we would find that RA» is constant so that the factor in question would be 
proportional to #*, that is, b = 1. However, if 2 is changed by varying the 
contact pressure, the effect would show up through the factor a’, (8 also, 
to some extent, perhaps) and, since one would expect @ to increase somewhat 
with pressure whereas P decreases with pressure, the factor of interest would 
probably depend upon some power of & between 3 and 4, that is, 1 < 6 < 2. 

The theory formulated here suffers from the difficulty that the integral 
of the power spectrum with respect to frequency is logarithmically divergent 
atOand ©, that is 


@2 
| S(w) dw = dw w= log (w2/w;) > © asa, >0and wa, >. 
@) 

The divergence at © does not bother us as much as the divergence at 0 
since, with only a divergence at ©, the self-covariance R,(t)R,(¢ + u) exists 
for all non-vanishing values of «; whereas, with a singularity at 0, the self- 
covariance does not exist for any value of «. For this reason we cannot con- 
sider the self-covariance here. In Part ¢ of this Section we consider a possible 
way of removing the divergence at 0, and consequently, then, we are able 
to calculate the self-covariance for non-vanishing values of u. 

c. Refinement of the Theory of Part b. Here we propose a simple modifi- 
cation of the model of Part b, removing the divergence of the integral of 
S(w) at w = 0. The modification considered here, although it is one of several 
possibilities any one of which is sufficient for removing the divergence 
(See Section 6.), is perhaps the only one that is sufficiently simple to treat 
in a memorandum of this scope. The results of this section are thus intended 
to be only provisional and suggestive. 

Let us reconsider the statistics of the function /(r) giving the separation 
between the surfaces near a point r on the mid-plane. The distribution of 
h’s considered in the last section is open to several criticisms: (1) it possesses 
no characteristic length parallel to the mid-plane; and (2) the self-covariance 
hy(r)hy(r’)“° does not exist for any value of r — 1’. 

To correct partially for these difficulties we replace Eq. (5.22) by 


(e) Be 


7 i. “7 
| Ak | All + 2°) 


where f is a new characteristic length. The self-covariance /,(r)/(1’) 


based upon (5.25) now exists for all values of r — r’ except 0. Thus we still 
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have the objection that the varianceh? is infinite; however, this will 
cause us no trouble. 

With Eq. (5.25) instead of (5.22) the spectral of density R; takes the form 
SOR kk 
o 1+ HR w+ Dk 
= (x /41s’") Ba’ R*Ay-1/w-O(y), 


2 
v(y ~ hog y) 
‘ TT 


My) = —__—_.___ , = t°w/D. 

VY i+? y / 
In obtaining the above equation we have made the usual assumption that 
the frequencies of interest are larger than w) = 4m°D/L’, and have replaced 
the original sum by an integral. The function Q(y) has the following proper- 
ties: 


S(w) = xD rs «Bae R‘A, - 





Oy) ~ —-y log y for y<i 
(5.27) 


Qty) ~1 for y>1 | 


Hence for w < D/€, S(w) « log w, the integral of which converges as w > 0; 
whereas, for w > D/£*, S(w) differs negligibly from that given by the unre- 
fined theory (Eq. (5.24)). 
The self-covariance Ri(¢) Ri(¢ + «) now exists for all non-vanishing u and 
is given by 
) ’ go Dubtnas 


bp 1+ 0R (5.28) 
= (x/4es’") Bo? RAs?" "| —Ei( —Du/e*)] 


where 


— Ei(—x) / e” dv/v, 


~ —logyx for x+<1, 


Y 





Thus for u< + 'D, Ry ()Ra(t + u) a — log (yDu/f) and for u > @/D, 
Ri (t)Ri(t + u) a 1/u. 
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Thus we have illustrated how one modification of the model has removed 
the divergence at w = 0. 

It appears from the treatment here and in part b that roughness and 
diffusion in two dimensions are essential (at least in a linear treatment) 
features in obtaining S(w)@ 1/w. In the case of a non-linear coupling (to be 
considered in a later paper) a ‘‘self-induced”’ roughness effect may occur 
without introducing roughness ab initio as an intrinsic feature of the model. 


6. SUMMARY 


(a) If the resistance deviation R,(/) is related to the concentration devia- 
tion ¢,(7, ¢) of a diffusing medium (particles or heat) by the /inear functional 


R(t) cd / f(na(r, t) dr, (6.1) 
Ay 


where r is a vector and dr a volume element in a v-dimensional space of 
volume A,, then the spectral density S(w) of R,(¢) is 


2 xA,D kl fe |’ 


eat ik w? + Dek’ 


(6.2) 
where D is the diffusion constant, s” is defined by Eq. (3.14), x is the Boltz- 
mann constant, w is the frequency (in radians per sec.), R is the wave number 
vector in v-dimensional k-space limited to a discrete lattice of points (defined 
by Eq. (3.2)) over which the summation is taken, and f, is the Ath Fourier 
component of f(r) (Eq. (3.3)). 

(a) If the important terms in (6.2) vary slowly between lattice points in 
k-space (true if w > wo given by Eq. (3.18)), then (6.1) can be replaced by 
the integral 


S(w) = tg” | F(R) |" Mak (6.3) 


w? + DR’ 


where the integration extends over the entire k-space and where /(k) is 
given by (Eq. 3.20) 


fk) = any | fie "® dr. (6.4) 
Ay 


(b) Let w’ be a frequency in the middle of a wide range. Suppose | f(R) |? 
averaged over the total solid angle in v-dimensional k-space is proportional 
to k’*, where n is an integer, in a wide range of k with k = V/w’/D in its 
middle. It follows then that S(w) « D”"”* w'*"*"” as long as -1 < 2n 
+v+1< 3. Asa consequence, we see that with m an integer (as is true for 
the simple cases considered in Section 5) » must be 2—the only even di- 
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mensionality—in order that S(w) be inversely proportional to w in agree- 
ment with experiment. In this case the only allowed value of n is —1. 

(c) From (b) we have the interesting result that S(w) is independent of D 
when it is inversely proportional to w. This means that very slowly diffusing 
substances can contribute as much to contact noise as rapidly diffusing 
substances. This result can be derived on quite dimensional grounds and is 
not dependent upon the special assumptions underlying our treatment. 

(d) A system comprising a high resistance layer modulated by the three- 
dimensional diffusion of particles or heat gives S(w) « w’”, See Case a.(i) 
in Section 5. 

(e) In a system composed of a localized contact disturbed by a diffusing 
surface layer (See Case a.(ii), Section 5), the self-covariance R,(/)Ri(¢ + u) 
is inversely proportional to A + Du where A may be considered the contact 
area. We have S(w) « — log a + const. for w «< D/A and S(w) « w? 
for w > D/A. 

(f) In a system involving the contact between relatively large areas of 
rough surfaces covered with diffusing surface layers (Cases b. and c., Section 
5), we have been successful in obtaining S(w) « w™, and also in obtaining 
a reasonable dependence upon the average resistance. 


APPENDIX I 
SPECTRAL DENSITY AND THE SELF-COVARIANCE 


Here we consider in detail the spectral density, the self-covariance, and 
the relation between these two quantities, first for the case of a single random 
variable. The treatment is subsequently extended to the case of a set of 
random variables which necessitates the consideration of the spectral density 
matrix and the covariance mairix. 

Let y(/)be a real random variable whose time average vanishes, y(/) = 0. 
Now the m.s. value of y can be defined 


3 0 ne a 
y(t) = Lim y(t, r) dl (I-1) 


T30 TT 20 


. ° 7, : ° . ° 
where y(/, 7) = y(¢) in the interval a ¢ < 5 and vanishes outside this 


interval. Evidently y(/, 7) can be expressed by the Fourier integral 


+00 
y(t, r) = i] s(w, re! dw 


~ 20 


where 
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By Parseval’s theorem we obtain 


+0 +00 
/ y(t, 7) dt = 2x | | y(w, 7) |° dw, 


i) 


which, when combined with (I-1), gives finally the desired result (using 
the fact that | y(w, 7) |? is an even function of w) 


y' (J) = | V(w) dw (I-3) 
0 
where 
E a 2 
Y(w) = 4m Lim — | y(w, 7) | 


tT 0 T 


is the spectral density. 
By a procedure not very different from the preceding, one can show that 


y(t) y(t +u) = [ Y(w) cos wu dw, (1-5) 
0 


) - —————————— 
Y(w) = — [ x06 + u) cos wu du. (1-6) 
T 


The quantity (v(/)y(¢ + «) is called the self-covariance. 

Now let us suppose that we have a set of random variables y,(/) which are 
in general complex and whose time averages vanish. We are then led to 
consider, instead of (1-3), relations of the form 


y. (ye (0) = [ Vi(w) dw (I-7) 
0 


. 


where now 


“a ae — 
Vi(w) = 22 Lim — [y,(a, ry? (w, tT) + ¥:(—a, r)y*( -w, 7)| (I-8) 
T-70 T 


in which 


1 sini: me 
. | y (le'* dt. 
dr (r/2 r 


Instead of self-covariances like y(/)y(¢ + «) we have to consider a covari- 





ance matrix of the form y;,(¢)y; (¢ + «). Since we shall not have occasion in 
this paper to consider the relation between the spectral density matrix and 
the covariance matrix we will not consider the derivation of the analogue of 


Eq. (1-5). 
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APPENDIX II 


THERMODYNAMIC THEORY OF FLUCTUATIONS 





The value of the quantity o(r, ¢)c,(r', ¢) or (cp(t)cgi(t)) is determined from 
equilibrium considerations. Before going into the above continuum problem 
let us first consider the problem for the case of a system described by a finite 
set of variables. More specifically let us suppose that the state of the system 
subject to certain restraints (i.e. fixed total mass and energy) is described 
by the set of variables x, , --- , x, . Let the equilibrium state be given by the 
values x}, °°: , x, , and let 


= x + Ay. (II-1) 
If the system is constrained to constant average energy E, the entropy of the 


non-equilibrium state § = S° + AS will be less than S°, the entropy of the 
equilibrium state, by an amount 


AS = 5D Siasay, (II-2) 
1) 


as 1 (dE 
OX; OX;/ 2, =z° OX; OX;/) 2,=2° 


Obviously, AS must be the negative of a positive definite quadratic form, 
otherwise the equilibrium state would not be a state of maximum entropy. 
The probability distribution” for the @’s is given by 


Play "Th Abs » &n) _ Ne*S!x (II-3) 


where V is a normalization factor and x is the Boltzmann constant. Averag- 
ing the products a; a; we find that 


DX Sisajar = xdix- (11-4) 
? 
Multiplying (II-4) by the arbitrary set y; and summing over i we get 
De viSij jan = x76. (1-5) 
) 


The generalization to a system described by a continuous set of variables 
is not difficult on the basis of (II-5). Now suppose that, in a v-dimensional 
space A, , we have a system whose state at time / is defined by the continuous 
set of values of the variable c(r, 4) = ¢ + c,(r, t); we have 


AS = —/ slr, ) dr (11-6) 
2 Ji. 


26H. B. G. Casimir, Rev. Mod. Phys. 17, Nos. 1 and 3, 343-4 (1945). 





LINEAR THEORY OF FLUCTUATIONS 


i as 1 (de 
free (F Z + T° (5 a 


when s and ¢ are the entropy and energy, respectively, per unit volume (of 
v-dimensional space). In calculating (II-6) it was assumed that 


/ o(r, 1) dr = 0, 
Ay 


expressing the fact that the system is closed. In order to put (II-6) into a 
form strictly analogous to (II-2) we write it 


AS «= 5 / / s’8(r — r')a(r, Da(r’, 2) dr dr’. (II-7) 
Ay MAy 


We see that the equation analogous to Eq. (II-5) must be 
| | y(r’)s” i(r — r)e(r’’, Dalr, 1) dr dr’ = xy(r) (II-8) 
A, VA, 


where y(r) is an arbitrary function. Integrating (II-8) with respect to r’’ 
and using the fact that the delta function is defined by 


[veneer —r) dr’ = y/(r) 
we readily arrive at the result 


a(r, De (r’, t) = 5 d(r — r’). 


Using the Fourier space-expansions of C; and 6(r) 


alr, 1) = Do’ cg(de'*, 
k 


1 | 
é(r) = —),e** 
M= Zee, 


in the region A, = 1X --- XL, with kj = 2nn;/L;, we can write (II-9) 
over into the equivalent expression: 


ce(Dce() = {yr Oke (12-10) 


say 


(1 if R= R’, 
Ser = | 
\0 otherwise. 





Abstracts of Technical Articles by Bell System Authors 


Audio-Frequency Measurements.' | W. L. Birack* and H. H. Scorrt. This 
paper indicates the theory involved in making measurements of gain, fre- 
quency response, distortion, and noise at audio frequencies, with particular 
emphasis on such measurements made on high-gain systems. There are 
also discussed techniques of measurement and factors affecting the accu- 
racy of results. This subject is not new art but has not previously been 
published in correlated form, to the knowledge of the authors. 

Growing Quartz Crystals” | E. BUEHLER and A. C. WALKER. The Bell 
Telephone Laboratories started an investigation of this subject in March 
1946, based on information gleaned from several investigators who visited 
Germany after the war, particularly Mr. J. R. Townsend of these Labora- 
tories, and Professor A. C. Swinnerton of Antioch College. After a relatively 
few experiments made with equipment similar to that used by Professor 
Richard Nacken in Germany, and with the process he described, it became 
apparent that Nacken had made substantial progress in the art of growing 
quartz at temperatures and pressures near the critical state of water, 1.e., 
about 374°C, and 3,200 pounds per square inch. This report summarizes 
further progress that has been made in the Laboratories since March 1946, 

Corrosion of Telephone Outside Plant Material’ | K. C. Compron and 
A. Menpizza. Problems resulting from corrosion in the telephone outside 


plant are many and varied. In this article an attempt is made to give a 


broad overall picture of these problems and the manner in which they are 
met and solved by the telephone plant engineer. 

Magnetic Recording in Motion Picture Techniques.s JOHN G. FRAYNE and 
Hatitey Worre. Development of magnetic recording at the Bell Telephone 
Laboratories is described with the application of such facilities to Western 
Electric recording and reproducing systems. A method of driving 35-mm. 
magnetic film with a flutter content not greater than 0.1 per cent is de- 
scribed, as is a multigap erasing head. 

Semi-Conducting Properties in Oxide Cathodes t+ N. B. Hannay, D. 
MacNarr, and A. H. Wutre. It has been widely assumed, without ade- 

' Proc. 1. R. E., v. 37, pp. 1108-1115, October 1949. 

* Of Bell Tel. Labs. 

2 Sci. Monthly, v. 69, pp. 148-155, September 1949. 

§ Corrosion, v. 5, pp. 194-197, June 1949. 

4S. M. P. E. Jour., v. 53, pp. 217-234, September 1949. 

5 Jour. Applied Physics, v. 20, pp. 669-681, July 1949. 

+ A reprint of this article may be obtained by writing to the Editor of the Bell System 
Technical Journal. 
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quate experimental verification, that barium-strontium oxide, as used in 
the oxide cathode, is an excess electronic semi-conductor. Accordingly, the 
electrical conductivity of (Ba,Sr)O has been studied as a function of tem- 
perature before and after activation with methane, extensive precautions 
being taken to exclude spurious effects. The increase in conductivity ob- 
tained characterizes (Ba,Sr)O as a “reduction” semi-conductor, and hence 
very probably as an electronic semi-conductor whose conduction electrons 
arise from a stoichiometric excess of (Ba,Sr) atoms in solid solution. 

A basic prediction of the semi-conductor theory has been tested quan- 
titatively with the finding that the electrical conductivity and the thermionic 
emission of a (Ba,Sr)O cathode are directly proportional through three 
orders of magnitude of activation; well-defined chemical and electrical 
activation and deactivation procedures were used in obtaining this result. 
It may be concluded that activation represents an increase in the chemical 
potential of the electrons in the oxide, little or no change in the state of the 
surface occurring. It has also been found that deviations from the propor 
tionality of conductivity and emission may be expected under conditions 


leading to inhomogeneity in the oxide, in agreement with the semi-conduc- 
tor theory also. 

Electron Microscope and Diffraction Study of Metal Crystal Textures by 
Means of Thin Sections. } R. D. Hetpenretcu. Bethe’s dynamical theory 


of electron diffraction in crystals is developed using the approximation of 
nearly free electrons and Brillouin zones. 

The use of Brillouin zones in describing electron diffraction phenomena 
proves to be illu inatin: since the energy discontinuity at a zone boundary 
is a fundamental quantity determining the existence of a Bragg reflection. 
The perturbation of the energy levels at a corner of a Brillouin zone is 
briefly discussed and the manner in which forbidden reflections may arise at 
a corner pointed out. It is concluded that the kinematic theory is inadequate 
for interpreting electron images of crystalline films. 

An electrolytic method for preparing thin metal sections for electron 
microscopy and diffraction is introduced and its application to the structure 
of cold-worked aluminum and an aluminum-copper alloy demonstrated. 
It is concluded that cold-worked aluminum initially consists of small, in- 
homogeneously strained and disoriented blocks about 200A in size. These 
blocks are not revealed by etching but would contribute to line broadening 
in conventional diffraction experiments. By means of a reorientation of the 
blocks through a nucleation and growth process, larger disoriented domains 
about 1-3y in size found experimentally could be accounted for. It is sug- 

§ Jour. Applied Physics, v. 20, pp. 993-1010, October 1949. 


+ A reprint of this article may be obtained by writing to the Editor of the Bell System 
Technical Journal. 
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gested that such a nucleation and growth reorientation phenomenon is re- 
sponsible for self-recovering in cold-worked metals. 

The formation of CuAl, precipitate particles is demonstrated with both 
electron micrographs and diffraction patterns. A fine lamellar structure found 
in the quenched Al-4 per cent Cu alloy is at present unexplained. 

Path-Length Microwave Lenses.’t Winston E. Kock. Lens antennas for 
microwave applications are described which produce a focusing effect by 
physically increasing the path lengths, compared to free space, of radio 
waves passing through the lens. This is accomplished by means of baffle 
plates which extend parallel to the magnetic vector, and which are either 
tilted or bent into serpentine shape so as to force the waves to travel the 
longer-inclined or serpentine path. The three-dimensional contour of the 
plate array is shaped to correspond to a convex lens. The advantages over 
previous metallic lenses are: broader band performance, greater simplicity, 
and less severe tolerances. 

Refracting Sound Waves.t Wrtxston E. Kock and F. K. Harvey. 
Structures are described which refract and focus sound waves. They are 
similar in principle to certain recently developed electromagnetic wave lenses 
in that they consist of arrays of obstacles which are small compared to the 
wave-length. These obstacles increase the effective density of the medium 
and thus effect a reduced propagation velocity of sound waves passing 
through the array. This reduced velocity is synonymous with refractive 
power so that lenses and prisms can be designed. When the obstacles ap- 
proach a half wave-length in size, the refractive index varies with wave- 
length and prisms then cause a dispersion of the waves (sound spectrum 
analyzer). Path length delay type lenses for focusing sound waves are also 
described. A diverging lens is discussed which produces a more uniform 
angular distribution of high frequencies from a loud speaker. 

Double-Stream Amplifiers.°} J. R. Prerce. This paper presents expressions 
useful in evaluating the gain of a double-stream amplifier having thin con- 
centric electron streams of different velocity and input and output gaps 
across which both streams pass. 

Direct Voltage Performance Test for Capacitor Paper.°+ H. A. SAUER and 
D. A. McLean. Performance of capacitors on accelerated life test may vary 
over a wide range depending upon the capacitor paper used. Indeed, at 
present a life test appears to be the only practical means for evaluating 

7 Proc. I. R. E., v. 37, pp. 852-855, August 1949. 

5 Acous. Soc. Amer. Jour., v. 21, pp. 471-481, September 1949. 

® Proc. I. R. E., v. 37, pp. 980-985, September 1949, 

Proc. I. R. E., v. 37, pp. 927-931, August 1949. 


+ A reprint of this article may be obtained by writing to the Editor of the Bell System 
Technical Journal. 
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capacitor paper, since, within the limits observed in commercial material, 
the chemical and physical tests usually made do not correlate with life. 
Lack of correlation is ascribed to obscure physical factors which have not 
yet been identified. 

Generally, several weeks are required to evaluate a paper by life tests of 
the usual severity. Unfortunately, the duration of these tests is too long for 
quality control of paper. 

The desire for a life test which requires no more than a day or two for 
evaluation led to the development of a rapid d-c. test. The philosophy of 
rapid life testing is based upon the experimental evidence that the process 
of deterioration under selected temperature and voltage conditions is prin- 
cipally of a chemical nature, and also upon the well-known fact that rates 
of chemical reaction increase exponentially with temperature. 

Life tests on two-layer capacitors conducted at 130°C. provide an ac- 
celeration in deterioration many fold more than that obtained in the lower- 
temperature life tests, and correlate well with these tests. 
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